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Preface 


Welcome to University Physics, an OpenStax resource. This textbook was 
written to increase student access to high-quality learning materials, 
maintaining highest standards of academic rigor at little to no cost. 


About OpenStax 


OpenStax is a nonprofit based at Rice University, and it’s our mission to 
improve student access to education. Our first openly licensed college 
textbook was published in 2012 and our library has since scaled to over 25 
books used by hundreds of thousands of students across the globe. 
OpenStax Tutor, our low-cost personalized learning tool, is being used in 
college courses throughout the country. The OpenStax mission is made 
possible through the generous support of philanthropic foundations. 
Through these partnerships and with the help of additional low-cost 
resources from our OpenStax partners, OpenStax is breaking down the most 
common barriers to learning and empowering students and instructors to 
succeed. 


About OpenStax's resources 


Customization 


University Physics is licensed under a Creative Commons Attribution 4.0 
International (CC BY) license, which means that you can distribute, remix, 
and build upon the content, as long as you provide attribution to OpenStax 
and its content contributors. 


Because our books are openly licensed, you are free to use the entire book 
or pick and choose the sections that are most relevant to the needs of your 
course. Feel free to remix the content by assigning your students certain 
chapters and sections in your syllabus in the order that you prefer. You can 
even provide a direct link in your syllabus to the sections in the web view of 
your book. 


Instructors also have the option of creating a customized version of their 
OpenStax book. The custom version can be made available to students in 
low-cost print or digital form through their campus bookstore. Visit your 
book page on OpenStax.org for more information. 


Errata 


All OpenStax textbooks undergo a rigorous review process. However, like 
any professional-grade textbook, errors sometimes occur. Since our books 
are web based, we can make updates periodically when deemed 
pedagogically necessary. If you have a correction to suggest, submit it 
through the link on your book page on OpenStax.org. Subject matter 
experts review all errata suggestions. OpenStax is committed to remaining 
transparent about all updates, so you will also find a list of past errata 
changes on your book page on OpenStax.org. 


Format 


You can access this textbook for free in web view or PDF through 
OpenStax.org, and for a low cost in print. 


About University Physics 


University Physics is designed for the two- or three-semester calculus-based 
physics course. The text has been developed to meet the scope and 
sequence of most university physics courses and provides a foundation for a 
career in mathematics, science, or engineering. The book provides an 
important opportunity for students to learn the core concepts of physics and 
understand how those concepts apply to their lives and to the world around 
them. 


Due to the comprehensive nature of the material, we are offering the book 
in three volumes for flexibility and efficiency. 


Coverage and scope 


Our University Physics textbook adheres to the scope and sequence of most 
two- and three-semester physics courses nationwide. We have worked to 
make physics interesting and accessible to students while maintaining the 
mathematical rigor inherent in the subject. With this objective in mind, the 
content of this textbook has been developed and arranged to provide a 
logical progression from fundamental to more advanced concepts, building 
upon what students have already learned and emphasizing connections 
between topics and between theory and applications. The goal of each 
section is to enable students not just to recognize concepts, but to work with 
them in ways that will be useful in later courses and future careers. The 
organization and pedagogical features were developed and vetted with 
feedback from science educators dedicated to the project. 
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Pedagogical foundation 


Throughout University Physics you will find derivations of concepts that 
present classical ideas and techniques, as well as modern applications and 
methods. Most chapters start with observations or experiments that place 
the material in a context of physical experience. Presentations and 
explanations rely on years of classroom experience on the part of long-time 
physics professors, striving for a balance of clarity and rigor that has proven 
successful with their students. Throughout the text, links enable students to 
review earlier material and then return to the present discussion, reinforcing 
connections between topics. Key historical figures and experiments are 
discussed in the main text (rather than in boxes or sidebars), maintaining a 
focus on the development of physical intuition. Key ideas, definitions, and 
equations are highlighted in the text and listed in summary form at the end 
of each chapter. Examples and chapter-opening images often include 
contemporary applications from daily life or modern science and 
engineering that students can relate to, from smart phones to the internet to 
GPS devices. 


Assessments that reinforce key concepts 


In-chapter Examples generally follow a three-part format of Strategy, 
Solution, and Significance to emphasize how to approach a problem, how to 
work with the equations, and how to check and generalize the result. 
Examples are often followed by Check Your Understanding questions and 
answers to help reinforce for students the important ideas of the examples. 
Problem-Solving Strategies in each chapter break down methods of 


approaching various types of problems into steps students can follow for 
guidance. The book also includes exercises at the end of each chapter so 
students can practice what they’ ve learned. 


Conceptual questions do not require calculation but test student 
learning of the key concepts. 

Problems categorized by section test student problem-solving skills 
and the ability to apply ideas to practical situations. 

Additional Problems apply knowledge across the chapter, forcing 
students to identify what concepts and equations are appropriate for 
solving given problems. Randomly located throughout the problems 
are Unreasonable Results exercises that ask students to evaluate the 
answer to a problem and explain why it is not reasonable and what 
assumptions made might not be correct. 

Challenge Problems extend text ideas to interesting but difficult 
situations. 


Answers for selected exercises are available in an Answer Key at the end 
of the book. 


Additional resources 


Student and instructor resources 


We’ve compiled additional resources for both students and instructors, 
including Getting Started Guides, PowerPoint slides, and answer and 
solution guides for instructors and students. Instructor resources require a 
verified instructor account, which you can apply for when you log in or 
create your account on OpenStax.org. Take advantage of these resources to 
supplement your OpenStax book. 


Community Hubs 


OpenStax partners with the Institute for the Study of Knowledge 
Management in Education (ISKME) to offer Community Hubs on OER 


Commons — a platform for instructors to share community-created 
resources that support OpenStax books, free of charge. Through our 
Community Hubs, instructors can upload their own materials or download 
resources to use in their own courses, including additional ancillaries, 
teaching material, multimedia, and relevant course content. We encourage 
instructors to join the hubs for the subjects most relevant to your teaching 
and research as an opportunity both to enrich your courses and to engage 
with other faculty. 


To reach the Community Hubs, visit www.oercommons.org/hubs/OpenStax. 


Partner resources 


OpenStax partners are our allies in the mission to make high-quality 
learning materials affordable and accessible to students and instructors 
everywhere. Their tools integrate seamlessly with our OpenStax titles at a 
low cost. To access the partner resources for your text, visit your book page 
on OpenStax.org. 
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These 
snowshoers on 
Mount Hood in 

Oregon are 
enjoying the heat 
flow and light 
caused by high 
temperature. All 
three 
mechanisms of 
heat transfer are 
relevant to this 
picture. The heat 
flowing out of 
the fire also 
turns the solid 
snow to liquid 
water and vapor. 
(credit: 
modification of 
work by “Mt. 
Hood 
Territory”/Flickr 


) 


Heat and temperature are important concepts for each of us, every day. How 
we dress in the morning depends on whether the day is hot or cold, and 
most of what we do requires energy that ultimately comes from the Sun. 
The study of heat and temperature is part of an area of physics known as 
thermodynamics. The laws of thermodynamics govern the flow of energy 
throughout the universe. They are studied in all areas of science and 
engineering, from chemistry to biology to environmental science. 


In this chapter, we explore heat and temperature. It is not always easy to 
distinguish these terms. Heat is the flow of energy from one object to 
another. This flow of energy is caused by a difference in temperature. The 
transfer of heat can change temperature, as can work, another kind of 
energy transfer that is central to thermodynamics. We return to these basic 
ideas several times throughout the next four chapters, and you will see that 
they affect everything from the behavior of atoms and molecules to cooking 
to our weather on Earth to the life cycles of stars. 


Temperature and Thermal Equilibrium 
By the end of this section, you will be able to: 


e Define temperature and describe it qualitatively 
e Explain thermal equilibrium 
e Explain the zeroth law of thermodynamics 


Heat is familiar to all of us. We can feel heat entering our bodies from the 
summer Sun or from hot coffee or tea after a winter stroll. We can also feel 
heat leaving our bodies as we feel the chill of night or the cooling effect of 
sweat after exercise. 


What is heat? How do we define it and how is it related to temperature? 
What are the effects of heat and how does it flow from place to place? We 
will find that, in spite of the richness of the phenomena, a small set of 
underlying physical principles unites these subjects and ties them to other 
fields. We start by examining temperature and how to define and measure it. 


Temperature 


The concept of temperature has evolved from the common concepts of hot 
and cold. The scientific definition of temperature explains more than our 
senses of hot and cold. As you may have already learned, many physical 
quantities are defined solely in terms of how they are observed or measured, 
that is, they are defined operationally. Temperature is operationally 
defined as the quantity of what we measure with a thermometer. As we will 
see in detail in a later chapter on the kinetic theory of gases, temperature is 
proportional to the average kinetic energy of translation, a fact that provides 
a more physical definition. Differences in temperature maintain the transfer 
of heat, or heat transfer, throughout the universe. Heat transfer is the 
movement of energy from one place or material to another as a result of a 
difference in temperature. (You will learn more about heat transfer later in 
this chapter.) 


Thermal Equilibrium 


An important concept related to temperature is thermal equilibrium. Two 
objects are in thermal equilibrium if they are in close contact that allows 
either to gain energy from the other, but nevertheless, no net energy is 
transferred between them. Even when not in contact, they are in thermal 
equilibrium if, when they are placed in contact, no net energy is transferred 
between them. If two objects remain in contact for a long time, they 
typically come to equilibrium. In other words, two objects in thermal 
equilibrium do not exchange energy. 


Experimentally, if object A is in equilibrium with object B, and object B is 
in equilibrium with object C, then (as you may have already guessed) object 
A is in equilibrium with object C. That statement of transitivity is called the 
zeroth law of thermodynamics. (The number “zeroth” was suggested by 
British physicist Ralph Fowler in the 1930s. The first, second, and third 
laws of thermodynamics were already named and numbered then. The 
zeroth law had seldom been stated, but it needs to be discussed before the 
others, so Fowler gave it a smaller number.) Consider the case where A is a 
thermometer. The zeroth law tells us that if A reads a certain temperature 
when in equilibrium with B, and it is then placed in contact with C, it will 
not exchange energy with C; therefore, its temperature reading will remain 
the same ([link]). In other words, if two objects are in thermal equilibrium, 
they have the same temperature. 
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If thermometer A is in thermal equilibrium with object 
B, and B is in thermal equilibrium with C, then A is in 
thermal equilibrium with C. Therefore, the reading on 
A stays the same when A is moved over to make 
contact with C. 


A thermometer measures its own temperature. It is through the concepts of 
thermal equilibrium and the zeroth law of thermodynamics that we can say 
that a thermometer measures the temperature of something else, and to 
make sense of the statement that two objects are at the same temperature. 


In the rest of this chapter, we will often refer to “systems” instead of 
“objects.” As in the chapter on linear momentum and collisions, a system 
consists of one or more objects—but in thermodynamics, we require a 
system to be macroscopic, that is, to consist of a huge number (such as 10”° 
) of molecules. Then we can say that a system is in thermal equilibrium 
with itself if all parts of it are at the same temperature. (We will return to 
the definition of a thermodynamic system in the chapter on the first law of 
thermodynamics.) 


Summary 


e Temperature is operationally defined as the quantity measured by a 
thermometer. It is proportional to the average kinetic energy of atoms 
and molecules in a system. 

e Thermal equilibrium occurs when two bodies are in contact with each 
other and can freely exchange energy. Systems are in thermal 
equilibrium when they have the same temperature. 

e The zeroth law of thermodynamics states that when two systems, A 
and B, are in thermal equilibrium with each other, and B is in thermal 
equilibrium with a third system C, then A is also in thermal 
equilibrium with C. 


Conceptual Questions 


Exercise: 


Problem: 


What does it mean to say that two systems are in thermal equilibrium? 


Solution: 


They are at the same temperature, and if they are placed in contact, no 
net heat flows between them. 


Exercise: 


Problem: 


Give an example in which A has some kind of non-thermal equilibrium 
relationship with B, and B has the same relationship with C, but A does 
not have that relationship with C. 


Glossary 


heat transfer 
movement of energy from one place or material to another as a result 
of a difference in temperature 


temperature 
quantity measured by a thermometer, which reflects the mechanical 
energy of molecules in a system 


thermal equilibrium 
condition in which heat no longer flows between two objects that are 
in contact; the two objects have the same temperature 


zeroth law of thermodynamics 
law that states that if two objects are in thermal equilibrium, and a 
third object is in thermal equilibrium with one of those objects, it is 
also in thermal equilibrium with the other object 


Thermometers and Temperature Scales 
By the end of this section, you will be able to: 


e Describe several different types of thermometers 
e Convert temperatures between the Celsius, Fahrenheit, and Kelvin 
scales 


Any physical property that depends consistently and reproducibly on 
temperature can be used as the basis of a thermometer. For example, 
volume increases with temperature for most substances. This property is the 
basis for the common alcohol thermometer and the original mercury 
thermometers. Other properties used to measure temperature include 
electrical resistance, color, and the emission of infrared radiation ({link]). 
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(c) 


Because many physical properties depend on temperature, the variety 
of thermometers is remarkable. (a) In this common type of 
thermometer, the alcohol, containing a red dye, expands more rapidly 
than the glass encasing it. When the thermometer’s temperature 
increases, the liquid from the bulb is forced into the narrow tube, 
producing a large change in the length of the column for a small 


change in temperature. (b) Each of the six squares on this plastic 
(liquid crystal) thermometer contains a film of a different heat- 
sensitive liquid crystal material. Below 95 “F, all six squares are 
black. When the plastic thermometer is exposed to a temperature of 
95 °F, the first liquid crystal square changes color. When the 
temperature reaches above 96.8 °F, the second liquid crystal square 
also changes color, and so forth. (c) A firefighter uses a pyrometer to 
check the temperature of an aircraft carrier’s ventilation system. The 
pyrometer measures infrared radiation (whose emission varies with 
temperature) from the vent and quickly produces a temperature 
readout. Infrared thermometers are also frequently used to measure 
body temperature by gently placing them in the ear canal. Such 
thermometers are more accurate than the alcohol thermometers placed 
under the tongue or in the armpit. (credit b: modification of work by 
Tess Watson; credit c: modification of work by Lamel J. Hinton, U.S. 
Navy) 


Thermometers measure temperature according to well-defined scales of 
measurement. The three most common temperature scales are Fahrenheit, 
Celsius, and Kelvin. Temperature scales are created by identifying two 
reproducible temperatures. The freezing and boiling temperatures of water 
at standard atmospheric pressure are commonly used. 


On the Celsius scale, the freezing point of water is 0 °C and the boiling 
point is 100 °C. The unit of temperature on this scale is the degree Celsius 
(°C). The Fahrenheit scale (still the most frequently used for common 
purposes in the United States) has the freezing point of water at 32 °F and 
the boiling point at 212 °F. Its unit is the degree Fahrenheit (°F). You can 
see that 100 Celsius degrees span the same range as 180 Fahrenheit degrees. 
Thus, a temperature difference of one degree on the Celsius scale is 1.8 
times as large as a difference of one degree on the Fahrenheit scale, or 

ATp = 2 ATe. 


The definition of temperature in terms of molecular motion suggests that 
there should be a lowest possible temperature, where the average kinetic 


energy of molecules is zero (or the minimum allowed by quantum 
mechanics). Experiments confirm the existence of such a temperature, 
called absolute zero. An absolute temperature scale is one whose zero 
point is absolute zero. Such scales are convenient in science because several 
physical quantities, such as the volume of an ideal gas, are directly related 
to absolute temperature. 


The Kelvin scale is the absolute temperature scale that is commonly used in 
science. The SI temperature unit is the kelvin, which is abbreviated K (not 
accompanied by a degree sign). Thus 0 K is absolute zero. The freezing and 
boiling points of water are 273.15 K and 373.15 K, respectively. Therefore, 
temperature differences are the same in units of kelvins and degrees 
Celsius, or ATg = AT. 


The relationships between the three common temperature scales are shown 
in [link]. Temperatures on these scales can be converted using the equations 
in [link]. 


Freezing point Normal body Boiling point 
Absolute zero of water temperature of water 
‘en f -—s—_+ 
aS aa ttt 
—459.67 i 0 32 98.6 212 °F 
|/-——s°c >| 
AAA... —]$$ tp rast 
273.15 °C -178 0 37 100 °C 
255.25 310.15 |~———5 K+ 
te A$$} st tt 
0 K 273.15 373.15 K 


Relationships between the Fahrenheit, Celsius, and Kelvin temperature 
scales are shown. The relative sizes of the scales are also shown. 


To convert from... Use this equation... 


Celsius to Fahrenheit = 2T¢ +32 

Fahrenheit to Celsius le= 2.(Tp — 32) 

Celsius to Kelvin Tk = T¢ 4+ 273.15 

Kelvin to Celsius To = Tk — 273.15 
Fahrenheit to Kelvin Tk = (Tp — 32) + 273.15 
Kelvin to Fahrenheit Tr = (Tk — 273.15) + 32 


Temperature Conversions 


To convert between Fahrenheit and Kelvin, convert to Celsius as an 
intermediate step. 


Example: 

Converting between Temperature Scales: Room Temperature 

“Room temperature” is generally defined in physics to be 25 °C. (a) What 
is room temperature in “F? (b) What is it in K? 

Strategy 

To answer these questions, all we need to do is choose the correct 
conversion equations and substitute the known values. 

Solution 

To convert from °C to °F, use the equation 

Equation: 


9 


Substitute the known value into the equation and solve: 


Equation: 
9 ; : 
ji 5 (25 C)+32=77 °F. 


Similarly, we find that Tk = Tc + 273.15 = 298 K. 


The Kelvin scale is part of the SI system of units, so its actual definition is 
more complicated than the one given above. First, it is not defined in terms 
of the freezing and boiling points of water, but in terms of the triple point. 
The triple point is the unique combination of temperature and pressure at 
which ice, liquid water, and water vapor can coexist stably. As will be 
discussed in the section on phase changes, the coexistence is achieved by 
lowering the pressure and consequently the boiling point to reach the 
freezing point. The triple-point temperature is defined as 273.16 K. This 
definition has the advantage that although the freezing temperature and 
boiling temperature of water depend on pressure, there is only one triple- 
point temperature. 


Second, even with two points on the scale defined, different thermometers 
give somewhat different results for other temperatures. Therefore, a 
standard thermometer is required. Metrologists (experts in the science of 
measurement) have chosen the constant-volume gas thermometer for this 
purpose. A vessel of constant volume filled with gas is subjected to 
temperature changes, and the measured temperature is proportional to the 
change in pressure. Using “TP” to represent the triple point, 

Equation: 


pS fe. 
PTP 


The results depend somewhat on the choice of gas, but the less dense the 
gas in the bulb, the better the results for different gases agree. If the results 
are extrapolated to zero density, the results agree quite well, with zero 
pressure corresponding to a temperature of absolute zero. 


Constant-volume gas thermometers are big and come to equilibrium slowly, 
so they are used mostly as standards to calibrate other thermometers. 


Note: 
Visit this site to learn more about the constant-volume gas thermometer. 


Summary 


e Three types of thermometers are alcohol, liquid crystal, and infrared 
radiation (pyrometer). 

e The three main temperature scales are Celsius, Fahrenheit, and Kelvin. 
Temperatures can be converted from one scale to another using 
temperature Conversion equations. 

e The three phases of water (ice, liquid water, and water vapor) can 
coexist at a single pressure and temperature known as the triple point. 


Conceptual Questions 


Exercise: 
Problem: 
If a thermometer is allowed to come to equilibrium with the air, and a 
glass of water is not in equilibrium with the air, what will happen to 
the thermometer reading when it is placed in the water? 


Solution: 


The reading will change. 


Exercise: 


Problem: 
Give an example of a physical property that varies with temperature 
and describe how it is used to measure temperature. 

Problems 


Exercise: 
Problem: 
While traveling outside the United States, you feel sick. A companion 
gets you a thermometer, which says your temperature is 39. What scale 


is that on? What is your Fahrenheit temperature? Should you seek 
medical help? 


Solution: 


That must be Celsius. Your Fahrenheit temperature is 102 °F. Yes, it is 
time to get treatment. 


Exercise: 
Problem: What are the following temperatures on the Kelvin scale? 


(a) 68.0 °F, an indoor temperature sometimes recommended for 
energy conservation in winter 


(b) 134 °F, one of the highest atmospheric temperatures ever recorded 
on Earth (Death Valley, California, 1913) 


(c) 9890 °F, the temperature of the surface of the Sun 


Exercise: 


Problem: 


(a) Suppose a cold front blows into your locale and drops the 
temperature by 40.0 Fahrenheit degrees. How many degrees Celsius 
does the temperature decrease when it decreases by 40.0 “F? (b) Show 
that any change in temperature in Fahrenheit degrees is nine-fifths the 
change in Celsius degrees 


Solution: 


a. ATg = 22.2 °C; b. We know that ATp = Tro — Try. We also 
know that Tr. = 2To2 + 32 and Tr = 2To1 + 32. So, substituting, 


we have ATp = (2Tc2 ++ 32) — (2Tc1 + 32) Partially solving and 
rearranging the equation, we have ATp = 3 (Tc2 — Toi). Therefore, 
ATy = 2 AT. 

Exercise: 
Problem: 
An Associated Press article on climate change said, “Some of the ice 
shelf’s disappearance was probably during times when the planet was 
36 degrees Fahrenheit (2 degrees Celsius) to 37 degrees Fahrenheit (3 


degrees Celsius) warmer than it is today.” What mistake did the 
reporter make? 


Exercise: 
Problem: 
(a) At what temperature do the Fahrenheit and Celsius scales have the 


Same numerical value? (b) At what temperature do the Fahrenheit and 
Kelvin scales have the same numerical value? 


Solution: 


a. —40°; b. 575K 


Exercise: 


Problem: 


A person taking a reading of the temperature in a freezer in Celsius 
makes two mistakes: first omitting the negative sign and then thinking 
the temperature is Fahrenheit. That is, the person reads—ax “Cas x °F. 
Oddly enough, the result is the correct Fahrenheit temperature. What is 
the original Celsius reading? Round your answer to three significant 
figures. 


Glossary 


absolute temperature scale 
scale, such as Kelvin, with a zero point that is absolute zero 


absolute zero 
temperature at which the average kinetic energy of molecules is zero 


Celsius scale 
temperature scale in which the freezing point of water is 0 °C and the 
boiling point of water is 100 “C 


degree Celsius 
(°C) unit on the Celsius temperature scale 


degree Fahrenheit 
(°F) unit on the Fahrenheit temperature scale 


Fahrenheit scale 
temperature scale in which the freezing point of water is 32 °F and the 
boiling point of water is 212 °F 


Kelvin scale (K) 
temperature scale in which 0 K is the lowest possible temperature, 
representing absolute zero 


triple point 


pressure and temperature at which a substance exists in equilibrium as 
a solid, liquid, and gas 


Thermal Expansion 
By the end of this section, you will be able to: 


e Answer qualitative questions about the effects of thermal expansion 
e Solve problems involving thermal expansion, including those involving thermal stress 


The expansion of alcohol in a thermometer is one of many commonly encountered examples of thermal 
expansion, which is the change in size or volume of a given system as its temperature changes. The most 
visible example is the expansion of hot air. When air is heated, it expands and becomes less dense than 
the surrounding air, which then exerts an (upward) force on the hot air and makes steam and smoke rise, 
hot air balloons float, and so forth. The same behavior happens in all liquids and gases, driving natural 
heat transfer upward in homes, oceans, and weather systems, as we will discuss in an upcoming section. 
Solids also undergo thermal expansion. Railroad tracks and bridges, for example, have expansion joints to 
allow them to freely expand and contract with temperature changes, as shown in [link]. 


ia 
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(a) (b) 


(a) Thermal expansion joints like these in the (b) Auckland Harbour Bridge in New Zealand allow 
bridges to change length without buckling. (credit: modification of works by “SJi”/Wikimedia 
Commons) 


What is the underlying cause of thermal expansion? As previously mentioned, an increase in temperature 
means an increase in the kinetic energy of individual atoms. In a solid, unlike in a gas, the molecules are 
held in place by forces from neighboring molecules; as we saw in Oscillations, the forces can be modeled 
as in harmonic springs described by the Lennard-Jones potential. Energy in Simple Harmonic Motion 
shows that such potentials are asymmetrical in that the potential energy increases more steeply when the 
molecules get closer to each other than when they get farther away. Thus, at a given kinetic energy, the 
distance moved is greater when neighbors move away from each other than when they move toward each 
other. The result is that increased kinetic energy (increased temperature) increases the average distance 
between molecules—the substance expands. 


For most substances under ordinary conditions, it is an excellent approximation that there is no preferred 
direction (that is, the solid is “isotropic”), and an increase in temperature increases the solid’s size by a 


certain fraction in each dimension. Therefore, if the solid is free to expand or contract, its proportions stay 
the same; only its overall size changes. 


Note: 

Linear Thermal Expansion 

According to experiments, the dependence of thermal expansion on temperature, substance, and original 
length is summarized in the equation 

Equation: 


dL 
—=al 

dT 

where AL is the change in length L, AT is the change in temperature, and a is the coefficient of linear 
expansion, a material property that varies slightly with temperature. As a is nearly constant and also 
very small, for practical purposes, we use the linear approximation: 

Equation: 


AL = aLAT. 


[link] lists representative values of the coefficient of linear expansion. As noted earlier, AT is the same 
whether it is expressed in units of degrees Celsius or kelvins; thus, a may have units of 1/°C or 1/K with 
the same value in either case. Approximating a as a constant is quite accurate for small changes in 
temperature and sufficient for most practical purposes, even for large changes in temperature. We 
examine this approximation more closely in the next example. 


Coefficient of Linear Coefficient of Volume 

Material Expansion a (1/°C) Expansion 6 (1/°C) 
Solids 

Aluminum 25 x 1078 75 x 10°° 

Brass 19 x 10°° 56 x 10° 

Copper 17 x 10°° 51 x 10°° 

Gold 14 x 10°° 42 x 10°° 

Iron or steel 12 x 10° 35 x 10°° 


Invar (nickel-iron alloy) 0.9 x 10° 27 .% i" 


Coefficient of Linear Coefficient of Volume 


Material Expansion a (1/°C) Expansion 6 (1/°C) 
Lead 29 x 10°° 87 x 10° 
Silver 18 x 10°° 54 x 10-6 
Glass (ordinary) 9 x 10°° 27 «x 10°° 
Glass (Pyrex®) 3 x 10°° 9 x 10° 
Quartz 0.4 x 10-6 1 x 10-6 
Concrete, brick ~12 x 10-6 ~36 x 10-8 
Marble (average) 2.5 x 10-6 7.5 x 107° 
Liquids 

Ether 1650 x 10-° 
Ethyl alcohol 1100 x 10°° 
Gasoline 950 x 10-6 
Glycerin 500 x 10° 
Mercury 180 x 10°° 
Water 210 x 10-6 
Gases 


Air and most other gases at 3400 x 10-5 
atmospheric pressure 


Thermal Expansion Coefficients 


Thermal expansion is exploited in the bimetallic strip ([link]). This device can be used as a thermometer 
if the curving strip is attached to a pointer on a scale. It can also be used to automatically close or open a 
switch at a certain temperature, as in older or analog thermostats. 


i tf, 


(a) (b) 


The curvature of a 
bimetallic strip depends 
on temperature. (a) The 

strip is straight at the 
starting temperature, 
where its two 
components have the 
same length. (b) At a 
higher temperature, this 
strip bends to the right, 
because the metal on 
the left has expanded 
more than the metal on 
the right. At a lower 
temperature, the strip 
would bend to the left. 


Example: 

Calculating Linear Thermal Expansion 

The main span of San Francisco’s Golden Gate Bridge is 1275 m long at its coldest. The bridge is 
exposed to temperatures ranging from — 15 °C to 40 “C. What is its change in length between these 
temperatures? Assume that the bridge is made entirely of steel. 

Strategy 

Use the equation for linear thermal expansion AL = aLAT to calculate the change in length, AD. Use 
the coefficient of linear expansion a for steel from [link], and note that the change in temperature AT is 
55 °C. 

Solution 

Substitute all of the known values into the equation to solve for AL: 

Equation: 


12 x 10-6 


AL=a@laAr = 
aban = (=> 


) (2275m) (65 °C) = 0.84, 


Significance 
Although not large compared with the length of the bridge, this change in length is observable. It is 
generally spread over many expansion joints so that the expansion at each joint is small. 


Thermal Expansion in Two and Three Dimensions 


Unconstrained objects expand in all dimensions, as illustrated in [link]. That is, their areas and volumes, 
as well as their lengths, increase with temperature. Because the proportions stay the same, holes and 
container volumes also get larger with temperature. If you cut a hole in a metal plate, the remaining 
material will expand exactly as it would if the piece you removed were still in place. The piece would get 
bigger, so the hole must get bigger too. 


Note: 

Thermal Expansion in Two Dimensions 

For small temperature changes, the change in area AA is given by 
Equation: 


AA = 2a AAT 


where AA is the change in area A, AT is the change in temperature, and a is the coefficient of linear 
expansion, which varies slightly with temperature. (The derivation of this equation is analogous to that of 
the more important equation for three dimensions, below.) 


ee ea 


In general, objects expand in all directions as temperature increases. In these drawings, the original 
boundaries of the objects are shown with solid lines, and the expanded boundaries with dashed lines. 
(a) Area increases because both length and width increase. The area of a circular plug also increases. 

(b) If the plug is removed, the hole it leaves becomes larger with increasing temperature, just as if 
the expanding plug were still in place. (c) Volume also increases, because all three dimensions 
increase. 


Note: 

Thermal Expansion in Three Dimensions 

The relationship between volume and temperature a. is given by ae = BVAT, where £ is the 
coefficient of volume expansion. As you can show in [link], 6 = 3a. This equation is usually written as 
Equation: 


AV = BVAT. 


Note that the values of @ in [link] are equal to 3a except for rounding. 


Volume expansion is defined for liquids, but linear and area expansion are not, as a liquid’s changes in 
linear dimensions and area depend on the shape of its container. Thus, [link] shows liquids’ values of 8 
but not a. 


In general, objects expand with increasing temperature. Water is the most important exception to this rule. 
Water does expand with increasing temperature (its density decreases) at temperatures greater than 

4°C (40 °F). However, it is densest at +4 °C and expands with decreasing temperature between +4 °C 
and 0 °C (40 °F to 32 °F), as shown in [link]. A striking effect of this phenomenon is the freezing of 
water in a pond. When water near the surface cools down to 4 °C, it is denser than the remaining water 
and thus sinks to the bottom. This “turnover” leaves a layer of warmer water near the surface, which is 
then cooled. However, if the temperature in the surface layer drops below 4 °C, that water is less dense 
than the water below, and thus stays near the top. As a result, the pond surface can freeze over. The layer 
of ice insulates the liquid water below it from low air temperatures. Fish and other aquatic life can survive 
in 4 °C water beneath ice, due to this unusual characteristic of water. 
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This curve shows the density of water as a function of temperature. Note that the 
thermal expansion at low temperatures is very small. The maximum density at 4 °C 
is only 0.0075 % greater than the density at 2 °C, and 0.012 % greater than that at 
0 °C. The decrease of density below 4 °C occurs because the liquid water 


approachs the solid crystal form of ice, which contains more empty space than the 
liquid. 


Example: 

Calculating Thermal Expansion 

Suppose your 60.0-L (15.9 -gal-gal) steel gasoline tank is full of gas that is cool because it has just been 
pumped from an underground reservoir. Now, both the tank and the gasoline have a temperature of 

15.0 “C. How much gasoline has spilled by the time they warm to 35.0 “C? 

Strategy 

The tank and gasoline increase in volume, but the gasoline increases more, so the amount spilled is the 
difference in their volume changes. We can use the equation for volume expansion to calculate the 
change in volume of the gasoline and of the tank. (The gasoline tank can be treated as solid steel.) 
Solution 


1. Use the equation for volume expansion to calculate the increase in volume of the steel tank: 
Equation: 


AV; = BsVAT. 


2. The increase in volume of the gasoline is given by this equation: 
Equation: 


(aVars > BaisV eset 


3. Find the difference in volume to determine the amount spilled as 
Equation: 


Vepill = Vie a aE 


Alternatively, we can combine these three equations into a single equation. (Note that the original 
volumes are equal.) 


Equation: 
Vopill = Vere i B,)VAT 
= [(950 — 35) x 10°°/°C] (60.0 L) (20.0 °C) 
= 1.10L. 
Significance 


This amount is significant, particularly for a 60.0-L tank. The effect is so striking because the gasoline 
and steel expand quickly. The rate of change in thermal properties is discussed later in this chapter. 

If you try to cap the tank tightly to prevent overflow, you will find that it leaks anyway, either around the 
cap or by bursting the tank. Tightly constricting the expanding gas is equivalent to compressing it, and 
both liquids and solids resist compression with extremely large forces. To avoid rupturing rigid 
containers, these containers have air gaps, which allow them to expand and contract without stressing 
them. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Does a given reading on a gasoline gauge indicate more gasoline in 
cold weather or in hot weather, or does the temperature not matter? 


Solution: 


The actual amount (mass) of gasoline left in the tank when the gauge hits “empty” is less in the 
summer than in the winter. The gasoline has the same volume as it does in the winter when the “add 
fuel” light goes on, but because the gasoline has expanded, there is less mass. 


Thermal Stress 


If you change the temperature of an object while preventing it from expanding or contracting, the object 
is subjected to stress that is compressive if the object would expand in the absence of constraint and 
tensile if it would contract. This stress resulting from temperature changes is known as thermal stress. It 
can be quite large and can cause damage. 


To avoid this stress, engineers may design components so they can expand and contract freely. For 
instance, in highways, gaps are deliberately left between blocks to prevent thermal stress from 
developing. When no gaps can be left, engineers must consider thermal stress in their designs. Thus, the 
reinforcing rods in concrete are made of steel because steel’s coefficient of linear expansion is nearly 
equal to that of concrete. 


To calculate the thermal stress in a rod whose ends are both fixed rigidly, we can think of the stress as 
developing in two steps. First, let the ends be free to expand (or contract) and find the expansion (or 
contraction). Second, find the stress necessary to compress (or extend) the rod to its original length by the 
methods you studied in Static Equilibrium and Elasticity on static equilibrium and elasticity. In other 
words, the AL of the thermal expansion equals the AZ of the elastic distortion (except that the signs are 
opposite). 


Example: 

Calculating Thermal Stress 

Concrete blocks are laid out next to each other on a highway without any space between them, so they 
cannot expand. The construction crew did the work on a winter day when the temperature was 5 °C. 
Find the stress in the blocks on a hot summer day when the temperature is 38 °C. The compressive 
Young’s modulus of concrete is Y = 20 x 10°N/m’. 

Strategy 

According to the chapter on static equilibrium and elasticity, the stress F/A is given by 

Equation: 


where Y is the Young’s modulus of the material—concrete, in this case. In thermal expansion, 
AL = aL, AT. We combine these two equations by noting that the two AL/s are equal, as stated above. 


Because we are not given Lp or A, we can obtain a numerical answer only if they both cancel out. 
Solution 

We substitute the thermal-expansion equation into the elasticity equation to get 

Equation: 


F aLljAT 
A iff rae 
and as we hoped, Lg has canceled and A appears only in F/A, the notation for the quantity we are 
calculating. 
Now we need only insert the numbers: 
Equation: 


F 
a= (20 x 10° N/m’) (12 x 10-°/°C) (38°C —5 °C) =7.9 x 10°N/m’. 


Significance 
The ultimate compressive strength of concrete is 20 x 10°N i m’, so the blocks are unlikely to break. 
However, the ultimate shear strength of concrete is only 2 x 10°N/ m”, so some might chip off. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Two objects A and B have the same dimensions and are constrained 
identically. A is made of a material with a higher thermal expansion coefficient than B. If the objects 
are heated identically, will A feel a greater stress than B? 


Solution: 


Not necessarily, as the thermal stress is also proportional to Young’s modulus. 


Summary 


e Thermal expansion is the increase of the size (length, area, or volume) of a body due to a change in 
temperature, usually a rise. Thermal contraction is the decrease in size due to a change in 
temperature, usually a fall in temperature. 

e Thermal stress is created when thermal expansion or contraction is constrained. 


Conceptual Questions 


Exercise: 


Problem: 


Pouring cold water into hot glass or ceramic cookware can easily break it. What causes the breaking? 
Explain why Pyrex®, a glass with a small coefficient of linear expansion, is less susceptible. 


Solution: 


The cold water cools part of the inner surface, making it contract, while the rest remains expanded. 
The strain is too great for the strength of the material. Pyrex contracts less, so it experiences less 
strain. 


Exercise: 
Problem: 
One method of getting a tight fit, say of a metal peg in a hole in a metal block, is to manufacture the 


peg slightly larger than the hole. The peg is then inserted when at a different temperature than the 
block. Should the block be hotter or colder than the peg during insertion? Explain your answer. 


Exercise: 
Problem: 


Does it really help to run hot water over a tight metal lid on a glass jar before trying to open it? 
Explain your answer. 


Solution: 


In principle, the lid expands more than the jar because metals have higher coefficients of expansion 
than glass. That should make unscrewing the lid easier. (In practice, getting the lid and jar wet may 
make gripping them more difficult.) 


Exercise: 
Problem: 
When a cold alcohol thermometer is placed in a hot liquid, the column of alcohol goes down slightly 
before going up. Explain why. 

Exercise: 
Problem: 
Calculate the length of a 1-meter rod of a material with thermal expansion coefficient a when the 
temperature is raised from 300 K to 600 K. Taking your answer as the new initial length, find the 


length after the rod is cooled back down to 300 K. Is your answer 1 meter? Should it be? How can 
you account for the result you got? 


Solution: 


After being heated, the length is (1 + 300a) (1 m). After being cooled, the length is 

(1 — 300 a) (1 + 300 a) (1 m). That answer is not 1 m, but it should be. The explanation is that 
even if a is exactly constant, the relation AZ = aLAT is strictly true only in the limit of small AT. 
Since a values are small, the discrepancy is unimportant in practice. 


Exercise: 


Problem: 


Noting the large stresses that can be caused by thermal expansion, an amateur weapon inventor 
decides to use it to make a new kind of gun. He plans to jam a bullet against an aluminum rod inside 
a closed invar tube. When he heats the tube, the rod will expand more than the tube and a very strong 
force will build up. Then, by a method yet to be determined, he will open the tube in a split second 
and let the force of the rod launch the bullet at very high speed. What is he overlooking? 


Problems 


Exercise: 
Problem: 
The height of the Washington Monument is measured to be 170.00 m on a day when the temperature 
is 35.0 °C. What will its height be on a day when the temperature falls to —10.0 “C? Although the 


monument is made of limestone, assume that its coefficient of thermal expansion is the same as that 
of marble. Give your answer to five significant figures. 


Solution: 


Using [link] to find the coefficient of thermal expansion of marble: 
L=Io+AL = LIo(1+ aAT) =170m [1+ (2.5 x 10°°/°C) (—45.0 °C)] = 169.98 m. 
(Answer rounded to five significant figures to show the slight difference in height.) 


Exercise: 
Problem: 
How much taller does the Eiffel Tower become at the end of a day when the temperature has 
increased by 15 °C? Its original height is 321 m and you can assume it is made of steel. 

Exercise: 
Problem: 
What is the change in length of a 3.00-cm-long column of mercury if its temperature changes from 
37.0 °C to 40.0 °C, assuming the mercury is constrained to a cylinder but unconstrained in length? 


Your answer will show why thermometers contain bulbs at the bottom instead of simple columns of 
liquid. 


Solution: 


We use instead of a since this is a volume expansion with constant surface area. Therefore: 
AL = aLAT = (6.0 x 107°/°C) (0.0300 m) (3.00 °C) = 5.4 x 10°°m. 
Exercise: 
Problem: 
How large an expansion gap should be left between steel railroad rails if they may reach a maximum 
temperature 35.0 °C greater than when they were laid? Their original length is 10.0 m. 


Exercise: 


Problem: 


You are looking to buy a small piece of land in Hong Kong. The price is “only” $60,000 per square 
meter. The land title says the dimensions are 20m x 30m. By how much would the total price 
change if you measured the parcel with a steel tape measure on a day when the temperature was 

20 °C above the temperature that the tape measure was designed for? The dimensions of the land do 
not change. 


Solution: 


On the warmer day, our tape measure will expand linearly. Therefore, each measured dimension will 
be smaller than the actual dimension of the land. Calling these measured dimensions // and w/, we 
will find a new area, A. Let’s calculate these measured dimensions: 


U= ly — Al = (20m) — (20 °C) (20 m) (222g2* ) = 19.9952 m; 
Al=1 x wi= (29.9928 m) (19.9952 m) = 599.71 m?; 
Cost change = (A — Ar) ($8220) — ((600 — 599.71)m?) ( $825") — $17,000, 
Because the area gets smaller, the price of the land decreases by about $17,000. 

Exercise: 
Problem: 
Global warming will produce rising sea levels partly due to melting ice caps and partly due to the 
expansion of water as average ocean temperatures rise. To get some idea of the size of this effect, 
calculate the change in length of a column of water 1.00 km high for a temperature increase of 
1.00 *C. Assume the column is not free to expand sideways. As a model of the ocean, that is a 
reasonable approximation, as only parts of the ocean very close to the surface can expand sideways 


onto land, and only to a limited degree. As another approximation, neglect the fact that ocean 
warming is not uniform with depth. 


Exercise: 
Problem: 
(a) Suppose a meter stick made of steel and one made of aluminum are the same length at 0°C. 


What is their difference in length at 22.0 °C? (b) Repeat the calculation for two 30.0-m-long 
surveyor’s tapes. 


Solution: 


a. Use [link] to find the coefficients of thermal expansion of steel and aluminum. Then 
Niw~ Aled = (need AT = (ye - ot) (1.00 m)(22 °C) = 2.9 x 104m 


b. By the same method with Zp = 30.0 m, we have AL = 8.6 x 107? m. 
Exercise: 
Problem: 
(a) If a 500-mL glass beaker is filled to the brim with ethyl alcohol at a temperature of 5.00 °C, how 


much will overflow when the alcohol’s temperature reaches the room temperature of 22.0 °C? (b) 
How much less water would overflow under the same conditions? 


Exercise: 


Problem: 


Most cars have a coolant reservoir to catch radiator fluid that may overflow when the engine is hot. 
A radiator is made of copper and is filled to its 16.0-L capacity when at 10.0 “C. What volume of 
radiator fluid will overflow when the radiator and fluid reach a temperature of 95.0 °C, given that 
the fluid’s volume coefficient of expansion is 8 = 400 x 10~°/°C? (Your answer will be a 
conservative estimate, as most car radiators have operating temperatures greater than 95.0 °C). 


Solution: 


AV = 0.475 L 
Exercise: 
Problem: 
A physicist makes a cup of instant coffee and notices that, as the coffee cools, its level drops 3.00 
mm in the glass cup. Show that this decrease cannot be due to thermal contraction by calculating the 


decrease in level if the 350 cm? of coffee is in a 7.00-cm-diameter cup and decreases in temperature 
from 95.0 °C to 45.0 °C. (Most of the drop in level is actually due to escaping bubbles of air.) 


Exercise: 


Problem: 


The density of water at 0 °C is very nearly 1000 kg/ m° (it is actually 999.84 kg / m’), whereas the 


density of ice at 0 °C is 917 kg/ m’. Calculate the pressure necessary to keep ice from expanding 
when it freezes, neglecting the effect such a large pressure would have on the freezing temperature. 
(This problem gives you only an indication of how large the forces associated with freezing water 
might be.) 


Solution: 


If we start with the freezing of water, then it would expand to 


1000 kg/m* Damar 
(1 m’) (os) = 1.09m? = 1.98 x 10°N/m‘* of ice. 


Exercise: 
Problem: 


Show that 8 = 3a, by calculating the infinitesimal change in volume dV of a cube with sides of 
length L when the temperature changes by dT. 


Glossary 


coefficient of linear expansion 
(a) material property that gives the change in length, per unit length, per 1- °C change in 
temperature; a constant used in the calculation of linear expansion; the coefficient of linear 
expansion depends to some degree on the temperature of the material 


coefficient of volume expansion 
(8) similar to a but gives the change in volume, per unit volume, per 1-°C change in temperature 


thermal expansion 
change in size or volume of an object with change in temperature 


thermal stress 
stress caused by thermal expansion or contraction 


Heat Transfer, Specific Heat, and Calorimetry 
By the end of this section, you will be able to: 


e Explain phenomena involving heat as a form of energy transfer 
¢ Solve problems involving heat transfer 


We have seen in previous chapters that energy is one of the fundamental concepts of physics. Heat is a 
type of energy transfer that is caused by a temperature difference, and it can change the temperature of 
an object. As we learned earlier in this chapter, heat transfer is the movement of energy from one place 
or material to another as a result of a difference in temperature. Heat transfer is fundamental to such 
everyday activities as home heating and cooking, as well as many industrial processes. It also forms a 
basis for the topics in the remainder of this chapter. 


We also introduce the concept of internal energy, which can be increased or decreased by heat transfer. 
We discuss another way to change the internal energy of a system, namely doing work on it. Thus, we 
are beginning the study of the relationship of heat and work, which is the basis of engines and 
refrigerators and the central topic (and origin of the name) of thermodynamics. 


Internal Energy and Heat 


A thermal system has internal energy (also called thermal energy), which is the sum of the mechanical 
energies of its molecules. A system’s internal energy is proportional to its temperature. As we saw 
earlier in this chapter, if two objects at different temperatures are brought into contact with each other, 
energy is transferred from the hotter to the colder object until the bodies reach thermal equilibrium 
(that is, they are at the same temperature). No work is done by either object because no force acts 
through a distance (as we discussed in Work and Kinetic Energy). These observations reveal that heat 
is energy transferred spontaneously due to a temperature difference. [link] shows an example of heat 
transfer. 


(b) 


(a) Here, the soft drink has a higher temperature than the ice, so they are not in thermal 
equilibrium. (b) When the soft drink and ice are allowed to interact, heat is transferred from the 
drink to the ice due to the difference in temperatures until they reach the same temperature, 77, 

achieving equilibrium. In fact, since the soft drink and ice are both in contact with the 
surrounding air and the bench, the ultimate equilibrium temperature will be the same as that of 
the surroundings. 


The meaning of “heat” in physics is different from its ordinary meaning. For example, in conversation, 
we may Say “the heat was unbearable,” but in physics, we would say that the temperature was high. 
Heat is a form of energy flow, whereas temperature is not. Incidentally, humans are sensitive to heat 
flow rather than to temperature. 


Since heat is a form of energy, its SI unit is the joule (J). Another common unit of energy often used 
for heat is the calorie (cal), defined as the energy needed to change the temperature of 1.00 g of water 
by 1.00 *“C —specifically, between 14.5 “C and 15.5 °C, since there is a slight temperature 
dependence. Also commonly used is the kilocalorie (kcal), which is the energy needed to change the 
temperature of 1.00 kg of water by 1.00 °C. Since mass is most often specified in kilograms, the 
kilocalorie is convenient. Confusingly, food calories (sometimes called “big calories,” abbreviated 
Cal) are actually kilocalories, a fact not easily determined from package labeling. 


Mechanical Equivalent of Heat 


It is also possible to change the temperature of a substance by doing work, which transfers energy into 
or out of a system. This realization helped establish that heat is a form of energy. James Prescott Joule 
(1818-1889) performed many experiments to establish the mechanical equivalent of heat—the work 
needed to produce the same effects as heat transfer. In the units used for these two quantities, the 
value for this equivalence is 

Equation: 


1.000 kcal = 4186 J. 


We consider this equation to represent the conversion between two units of energy. (Other numbers 
that you may see refer to calories defined for temperature ranges other than 14.5 °C to 15.5 °C.) 


[link] shows one of Joule’s most famous experimental setups for demonstrating that work and heat can 
produce the same effects and measuring the mechanical equivalent of heat. It helped establish the 
principle of conservation of energy. Gravitational potential energy (U) was converted into kinetic 
energy (K), and then randomized by viscosity and turbulence into increased average kinetic energy of 
atoms and molecules in the system, producing a temperature increase. Joule’s contributions to 
thermodynamics were so significant that the SI unit of energy was named after him. 


Measured 
height of \— 
descent w 


at Insulated, known 
volume of water 


Joule’s experiment established the equivalence of heat and work. 
As the masses descended, they caused the paddles to do work, 
W = magh, on the water. The result was a temperature increase, 
AT, measured by the thermometer. Joule found that AT was 
proportional to W and thus determined the mechanical equivalent 
of heat. 


Increasing internal energy by heat transfer gives the same result as increasing it by doing work. 
Therefore, although a system has a well-defined internal energy, we cannot say that it has a certain 
“heat content” or “work content.” A well-defined quantity that depends only on the current state of the 
system, rather than on the history of that system, is known as a state variable. Temperature and 
internal energy are state variables. To sum up this paragraph, heat and work are not state variables. 


Incidentally, increasing the internal energy of a system does not necessarily increase its temperature. 
As we’ll see in the next section, the temperature does not change when a substance changes from one 
phase to another. An example is the melting of ice, which can be accomplished by adding heat or by 
doing frictional work, as when an ice cube is rubbed against a rough surface. 


Temperature Change and Heat Capacity 


We have noted that heat transfer often causes temperature change. Experiments show that with no 
phase change and no work done on or by the system, the transferred heat is typically directly 
proportional to the change in temperature and to the mass of the system, to a good approximation. 
(Below we show how to handle situations where the approximation is not valid.) The constant of 


proportionality depends on the substance and its phase, which may be gas, liquid, or solid. We omit 
discussion of the fourth phase, plasma, because although it is the most common phase in the universe, 
it is rare and short-lived on Earth. 


We can understand the experimental facts by noting that the transferred heat is the change in the 
internal energy, which is the total energy of the molecules. Under typical conditions, the total kinetic 
energy of the molecules Kita) is a constant fraction of the internal energy (for reasons and with 
exceptions that we’ll see in the next chapter). The average kinetic energy of a molecule K aye is 
proportional to the absolute temperature. Therefore, the change in internal energy of a system is 
typically proportional to the change in temperature and to the number of molecules, N. 
Mathematically, AU « AK total = NKave « NAT The dependence on the substance results in large 
part from the different masses of atoms and molecules. We are considering its heat capacity in terms of 
its mass, but as we will see in the next chapter, in some cases, heat capacities per molecule are similar 
for different substances. The dependence on substance and phase also results from differences in the 
potential energy associated with interactions between atoms and molecules. 


Note: 

Heat Transfer and Temperature Change 

A practical approximation for the relationship between heat transfer and temperature change is: 
Equation: 


Q =mcAT, 


where Q is the symbol for heat transfer (“quantity of heat”), m is the mass of the substance, and AT is 
the change in temperature. The symbol c stands for the specific heat (also called “specific heat 
capacity”) and depends on the material and phase. The specific heat is numerically equal to the 
amount of heat necessary to change the temperature of 1.00 kg of mass by 1.00 °C. The SI unit for 
specific heat is J/(kg x K) or J/(kg x °C). (Recall that the temperature change AT is the same 
in units of kelvin and degrees Celsius.) 


Values of specific heat must generally be measured, because there is no simple way to calculate them 
precisely. [link] lists representative values of specific heat for various substances. We see from this 
table that the specific heat of water is five times that of glass and 10 times that of iron, which means 
that it takes five times as much heat to raise the temperature of water a given amount as for glass, and 
10 times as much as for iron. In fact, water has one of the largest specific heats of any material, which 
is important for sustaining life on Earth. 


The specific heats of gases depend on what is maintained constant during the heating—typically either 
the volume or the pressure. In the table, the first specific heat value for each gas is measured at 
constant volume, and the second (in parentheses) is measured at constant pressure. We will return to 
this topic in the chapter on the kinetic theory of gases. 


Substances Specific Heat (c) 


Solids J/kg- °C kcal/kg - °C" 
Aluminum 900 0.215 
Asbestos 800 0.19 
Concrete, granite (average) 840 0.20 

Copper 387 0.0924 

Glass 840 0.20 

Gold 129 0.0308 
Human body (average at 37 °C) 3500 0.83 

Ice (average, —50 “C to0 °C) 2090 0.50 

Iron, steel 452 0.108 

Lead 128 0.0305 
Silver 235 0.0562 
Wood 1700 0.40 

Liquids 

Benzene 1740 0.415 
Ethanol 2450 0.586 
Glycerin 2410 0.576 
Mercury 139 0.0333 
Water (15.0 °C) 4186 1.000 
Gases!3! 

Air (dry) 721 (1015) 0.172 (0.242) 
Ammonia 1670 (2190) 0.399 (0.523) 
Carbon dioxide 638 (833) 0.152 (0.199) 


Nitrogen 739 (1040) 0.177 (0.248) 


Substances Specific Heat (c) 
Oxygen 651 (913) 0.156 (0.218) 
Steam (100 °C) 1520 (2020) 0.363 (0.482) 


Specific Heats of Various Substances[1][1]The values for solids and liquids are at constant volume and 
25 °C, except as noted. [2/These values are identical in units of cal/g - °C. [3]Specific heats at 
constant volume and at 20.0 °C except as noted, and at 1.00 atm pressure. Values in parentheses are 
specific heats at a constant pressure of 1.00 atm. 


In general, specific heat also depends on temperature. Thus, a precise definition of c for a substance 


must be given in terms of an infinitesimal change in temperature. To do this, we note that c = — oe 
and replace A with d: 
Equation: 
1 dQ 
c= — —. 
m dT 


Except for gases, the temperature and volume dependence of the specific heat of most substances is 
weak at normal temperatures. Therefore, we will generally take specific heats to be constant at the 
values given in the table. 


Example: 

Calculating the Required Heat 

A 0.500-kg aluminum pan on a stove and 0.250 L of water in it are heated from 20.0 °C to 80.0 °C. 
(a) How much heat is required? What percentage of the heat is used to raise the temperature of (b) the 
pan and (c) the water? 

Strategy 

We can assume that the pan and the water are always at the same temperature. When you put the pan 
on the stove, the temperature of the water and that of the pan are increased by the same amount. We 
use the equation for the heat transfer for the given temperature change and mass of water and 
aluminum. The specific heat values for water and aluminum are given in [link]. 

Solution 


1. Calculate the temperature difference: 
Equation: 


AL =1;—1, =60.0 C, 


2. Calculate the mass of water. Because the density of water is 1000 kg/ m®, 1 L of water has a 
mass of 1 kg, and the mass of 0.250 L of water is my, = 0.250 kg. 

3. Calculate the heat transferred to the water. Use the specific heat of water in [link]: 
Equation: 


Qw = MyCyAT = (0.250 kg) (4186 J/kg °C) (60.0 °C) = 62.8 kJ. 


4. Calculate the heat transferred to the aluminum. Use the specific heat for aluminum in [link]: 


Equation: 
Qai = Macq AT = (0.500 kg) (900 J/kg *C) (60.0 °C) = 27.0 kJ. 


5. Find the total transferred heat: 
Equation: 


Qtotal = Qw + Qa = 89.8 kJ. 


Significance 

In this example, the heat transferred to the container is a significant fraction of the total transferred 
heat. Although the mass of the pan is twice that of the water, the specific heat of water is over four 
times that of aluminum. Therefore, it takes a bit more than twice as much heat to achieve the given 
temperature change for the water as for the aluminum pan. 


[link] illustrates a temperature rise caused by doing work. (The result is the same as if the same 
amount of energy had been added with a blowtorch instead of mechanically.) 


Example: 

Calculating the Temperature Increase from the Work Done on a Substance 

Truck brakes used to control speed on a downhill run do work, converting gravitational potential 
energy into increased internal energy (higher temperature) of the brake material ([link]). This 
conversion prevents the gravitational potential energy from being converted into kinetic energy of the 
truck. Since the mass of the truck is much greater than that of the brake material absorbing the energy, 
the temperature increase may occur too fast for sufficient heat to transfer from the brakes to the 
environment; in other words, the brakes may overheat. 


The smoking brakes on a braking truck are visible evidence of the mechanical equivalent of heat. 


Calculate the temperature increase of 10 kg of brake material with an average specific heat of 

800 J/kg - °C if the material retains 10% of the energy from a 10,000-kg truck descending 75.0 m (in 
vertical displacement) at a constant speed. 

Strategy 

We calculate the gravitational potential energy (Mgh) that the entire truck loses in its descent, equate 
it to the increase in the brakes’ internal energy, and then find the temperature increase produced in the 
brake material alone. 

Solution 

First we calculate the change in gravitational potential energy as the truck goes downhill: 

Equation: 


Mgh = (10,000 kg) (9.80 m/s”) (75.0m) = 7.35 x 10°J. 


Because the kinetic energy of the truck does not change, conservation of energy tells us the lost 
potential energy is dissipated, and we assume that 10% of it is transferred to internal energy of the 
brakes, so take Q = Mgh/10. Then we calculate the temperature change from the heat transferred, 
using 

Equation: 


where m is the mass of the brake material. Insert the given values to find 
Equation: 


7.35 x 10°J 


— — 92 "Ce 
(10 kg) (800 J/kg °C) 


Significance 

If the truck had been traveling for some time, then just before the descent, the brake temperature 
would probably be higher than the ambient temperature. The temperature increase in the descent 
would likely raise the temperature of the brake material very high, so this technique is not practical. 
Instead, the truck would use the technique of engine braking. A different idea underlies the recent 
technology of hybrid and electric cars, where mechanical energy (kinetic and gravitational potential 
energy) is converted by the brakes into electrical energy in the battery, a process called regenerative 
braking. 


In a common kind of problem, objects at different temperatures are placed in contact with each other 
but isolated from everything else, and they are allowed to come into equilibrium. A container that 
prevents heat transfer in or out is called a calorimeter, and the use of a calorimeter to make 
measurements (typically of heat or specific heat capacity) is called calorimetry. 


We will use the term “calorimetry problem” to refer to any problem in which the objects concerned are 
thermally isolated from their surroundings. An important idea in solving calorimetry problems is that 
during a heat transfer between objects isolated from their surroundings, the heat gained by the colder 
object must equal the heat lost by the hotter object, due to conservation of energy: 


Note: 
Equation: 


Qeoid + Dror = 0. 


We express this idea by writing that the sum of the heats equals zero because the heat gained is usually 
considered positive; the heat lost, negative. 


Example: 

Calculating the Final Temperature in Calorimetry 

Suppose you pour 0.250 kg of 20.0-°C water (about a cup) into a 0.500-kg aluminum pan off the 
stove with a temperature of 150 °C. Assume no heat transfer takes place to anything else: The pan is 
placed on an insulated pad, and heat transfer to the air is neglected in the short time needed to reach 
equilibrium. Thus, this is a calorimetry problem, even though no isolating container is specified. Also 
assume that a negligible amount of water boils off. What is the temperature when the water and pan 
reach thermal equilibrium? 

Strategy 

Originally, the pan and water are not in thermal equilibrium: The pan is at a higher temperature than 
the water. Heat transfer restores thermal equilibrium once the water and pan are in contact; it stops 
once thermal equilibrium between the pan and the water is achieved. The heat lost by the pan is equal 
to the heat gained by the water—that is the basic principle of calorimetry. 

Solution 


1. Use the equation for heat transfer Q = mcAT to express the heat lost by the aluminum pan in 
terms of the mass of the pan, the specific heat of aluminum, the initial temperature of the pan, 
and the final temperature: 

Equation: 


Qhot = Maicai (Tt — 150 °C). 


2. Express the heat gained by the water in terms of the mass of the water, the specific heat of water, 
the initial temperature of the water, and the final temperature: 
Equation: 


Qeold = Mwew (T; — 20.0 “C). 


3. Note that Qnot < 0 and Qoiq > 0 and that as stated above, they must sum to zero: 
Equation: 


Occlae ee = 0 


Qeold = Sie 
MyCw (Ts — 20.0°C) = —mayca, (T? — 150 °C). 


4. Bring all terms involving T¢ on the left hand side and all other terms on the right hand side. 
Solving for T', 
Equation: 


7. = MAICAL (150 °C) + mycw (20.0 °C) 
Maicar + MyCw ; 


and insert the numerical values: 
Equation: 
(0.500 kg) (900 J/kg “C) (150 °C) + (0.250 kg) (4186 J/kg °C) (20.0 °C) 


= 59.1°C. 
(0.500 kg) (900 J/kg °C) + (0.250 kg) (4186 J/kg °C) 


jhe 


Significance 

Why is the final temperature so much closer to 20.0 °C than to 150 °C? The reason is that water has 
a greater specific heat than most common substances and thus undergoes a smaller temperature 
change for a given heat transfer. A large body of water, such as a lake, requires a large amount of heat 
to increase its temperature appreciably. This explains why the temperature of a lake stays relatively 
constant during the day even when the temperature change of the air is large. However, the water 
temperature does change over longer times (e.g., summer to winter). 


Note: 
Exercise: 


Problem: 


Check Your Understanding If 25 kJ is necessary to raise the temperature of a rock from 
25 °C to 30 °C, how much heat is necessary to heat the rock from 45 °C to 50 °C? 


Solution: 


To a good approximation, the heat transfer depends only on the temperature difference. Since the 
temperature differences are the same in both cases, the same 25 kJ is necessary in the second 
case. (As we will see in the next section, the answer would have been different if the object had 
been made of some substance that changes phase anywhere between 30 °C and 50 °C.) 


Example: 

Temperature-Dependent Heat Capacity 

At low temperatures, the specific heats of solids are typically proportional to T°. The first 
understanding of this behavior was due to the Dutch physicist Peter Debye, who in 1912, treated 
atomic oscillations with the quantum theory that Max Planck had recently used for radiation. For 


instance, a good approximation for the specific heat of salt, NaCl, is c = 3.33 x ee (ee 
The constant 321 K is called the Debye temperature of NaCl, Op, and the formula works well when 
T < 0.040p. Using this formula, how much heat is required to raise the temperature of 24.0 g of 
NaCl from 5 K to 15 K? 

Solution 

Because the heat capacity depends on the temperature, we need to use the equation 

Equation: 


m dT 
T> 
We solve this equation for Q by integrating both sides: Q = m ed?’ 
T; 

Then we substitute the given values in and evaluate the integral: 
Equation: 

T2 5 aT, T 3 i J ' 15K 

= (0.024k 333 x 10 dT = | 6.04 x 10 “— ]T = ole J 

re ae rea (a CMe ie) ag 
Significance 


If we had used the equation Q = mcAT and the room-temperature specific heat of salt, 
880 J/kg - K, we would have gotten a very different value. 


Summary 
¢ Heat and work are the two distinct methods of energy transfer. 


e Heat transfer to an object when its temperature changes is often approximated well by 
Q = mcAT, where m is the object’s mass and c is the specific heat of the substance. 


Conceptual Questions 


Exercise: 


Problem: How is heat transfer related to temperature? 


Solution: 
Temperature differences cause heat transfer. 


Exercise: 


Problem: Describe a situation in which heat transfer occurs. 


Exercise: 


Problem: When heat transfers into a system, is the energy stored as heat? Explain briefly. 


Solution: 


No, it is stored as thermal energy. A thermodynamic system does not have a well-defined quantity 
of heat. 


Exercise: 


Problem: 


The brakes in a car increase in temperature by AT’ when bringing the car to rest from a speed v. 
How much greater would AT be if the car initially had twice the speed? You may assume the car 
stops fast enough that no heat transfers out of the brakes. 


Problems 


Exercise: 
Problem: 
On a hot day, the temperature of an 80,000-L swimming pool increases by 1.50 “C. What is the 


net heat transfer during this heating? Ignore any complications, such as loss of water by 
evaporation. 


Solution: 


i= 5.20 x 108) 
Exercise: 
Problem: 
To sterilize a 50.0-g glass baby bottle, we must raise its temperature from 22.0 °C to 95.0 °C. 
How much heat transfer is required? 
Exercise: 
Problem: 
The same heat transfer into identical masses of different substances produces different 


temperature changes. Calculate the final temperature when 1.00 kcal of heat transfers into 1.00 kg 
of the following, originally at 20.0 °C: (a) water; (b) concrete; (c) steel; and (d) mercury. 


Solution: 


Q=mcAT => AT = &; a. 21.0 °C;b. 25.0 °C; c. 29.3 °C; d. 50.0 °C 


Exercise: 


Problem: 


Rubbing your hands together warms them by converting work into thermal energy. If a woman 
rubs her hands back and forth for a total of 20 rubs, at a distance of 7.50 cm per rub, and with an 
average frictional force of 40.0 N, what is the temperature increase? The mass of tissues warmed 
is only 0.100 kg, mostly in the palms and fingers. 


Exercise: 
Problem: 
A 0.250-kg block of a pure material is heated from 20.0 °C to 65.0 °C by the addition of 4.35 kJ 


of energy. Calculate its specific heat and identify the substance of which it is most likely 
composed. 


Solution: 


Q=mcAT Se= — = UO = 0.0924 kcal/kg - °C. It is copper. 
Exercise: 


Problem: 


Suppose identical amounts of heat transfer into different masses of copper and water, causing 
identical changes in temperature. What is the ratio of the mass of copper to water? 


Exercise: 


Problem: 


(a) The number of kilocalories in food is determined by calorimetry techniques in which the food 
is burned and the amount of heat transfer is measured. How many kilocalories per gram are there 
in a 5.00-g peanut if the energy from burning it is transferred to 0.500 kg of water held in a 0.100- 
kg aluminum cup, causing a 54.9-°C temperature increase? Assume the process takes place in an 
ideal calorimeter, in other words a perfectly insulated container. (b) Compare your answer to the 
following labeling information found on a package of dry roasted peanuts: a serving of 33 g 
contains 200 calories. Comment on whether the values are consistent. 


Solution: 


a. Q = MyCyAT + mayeqi AT = (Mycy + Ma1Ca1) AT; 
(0.500 kg) (1.00 kcal/kg - °C)-+ ; 
= 54.9 °C) = 28.63 kcal; 
2 = | (0.100 kg) (0.215 kcal/kg - °C) | ' ) - 


—— ee = 5.74 keal/e: b, 2 = acta = 6 kcal/g, which is consistent with our 


Mp 


results to part (a), to one significant figure. 
Exercise: 


Problem: 


Following vigorous exercise, the body temperature of an 80.0 kg person is 40.0 °C. At what rate 
in watts must the person transfer thermal energy to reduce the body temperature to 37.0 °C in 
30.0 min, assuming the body continues to produce energy at the rate of 150 W? 

(1 watt = 1 joule/second or 1 W = 1 J/s) 


Exercise: 


Problem: 


In a study of healthy young men[footnote], doing 20 push-ups in 1 minute burned an amount of 
energy per kg that for a 70.0-kg man corresponds to 8.06 calories (kcal). How much would a 
70.0-kg man’s temperature rise if he did not lose any heat during that time? 

JW Vezina, “An examination of the differences between two methods of estimating energy 
expenditure in resistance training activities,” Journal of Strength and Conditioning Research, 
April 28, 2014, http://www.ncbi.nlm.nih.gov/pubmed/24402448 


Solution: 


0.139 °C 

Exercise: 
Problem: 
A 1.28-kg sample of water at 10.0 °C is in a calorimeter. You drop a piece of steel with a mass of 
0.385 kg at 215 °C into it. After the sizzling subsides, what is the final equilibrium temperature? 
(Make the reasonable assumptions that any steam produced condenses into liquid water during 


the process of equilibration and that the evaporation and condensation don’t affect the outcome, 
as we’ll see in the next section.) 


Exercise: 
Problem: 
Repeat the preceding problem, assuming the water is in a glass beaker with a mass of 0.200 kg, 
which in turn is in a calorimeter. The beaker is initially at the same temperature as the water. 
Before doing the problem, should the answer be higher or lower than the preceding answer? 


Comparing the mass and specific heat of the beaker to those of the water, do you think the beaker 
will make much difference? 


Solution: 


It should be lower. The beaker will not make much difference: 16.3 °C 


Glossary 


calorie (cal) 
energy needed to change the temperature of 1.00 g of water by 1.00 °C 


calorimeter 
container that prevents heat transfer in or out 


calorimetry 
study of heat transfer inside a container impervious to heat 


heat 
energy transferred solely due to a temperature difference 


kilocalorie (kcal) 
energy needed to change the temperature of 1.00 kg of water between 14.5 °C and 15.5 °C 


mechanical equivalent of heat 
work needed to produce the same effects as heat transfer 


specific heat 
amount of heat necessary to change the temperature of 1.00 kg of a substance by 1.00 °C; also 
called “specific heat capacity” 


Phase Changes 
By the end of this section, you will be able to: 


e Describe phase transitions and equilibrium between phases 
¢ Solve problems involving latent heat 
¢ Solve calorimetry problems involving phase changes 


Phase transitions play an important theoretical and practical role in the study of heat flow. In melting (or 
“fusion”), a solid turns into a liquid; the opposite process is freezing. In evaporation, a liquid turns into a gas; the 
opposite process is condensation. 


A substance melts or freezes at a temperature called its melting point, and boils (evaporates rapidly) or 
condenses at its boiling point. These temperatures depend on pressure. High pressure favors the denser form, so 
typically, high pressure raises the melting point and boiling point, and low pressure lowers them. For example, 
the boiling point of water is 100 °C at 1.00 atm. At higher pressure, the boiling point is higher, and at lower 
pressure, it is lower. The main exception is the melting and freezing of water, discussed in the next section. 


Phase Diagrams 


The phase of a given substance depends on the pressure and temperature. Thus, plots of pressure versus 
temperature showing the phase in each region provide considerable insight into thermal properties of substances. 
Such a pT graph is called a phase diagram. 


[link] shows the phase diagram for water. Using the graph, if you know the pressure and temperature, you can 
determine the phase of water. The solid curves—boundaries between phases—indicate phase transitions, that is, 
temperatures and pressures at which the phases coexist. For example, the boiling point of water is 100 °C at 
1.00 atm. As the pressure increases, the boiling temperature rises gradually to 374 °C at a pressure of 218 atm. 
A pressure cooker (or even a covered pot) cooks food faster than an open pot, because the water can exist as a 
liquid at temperatures greater than 100 °C without all boiling away. (As we’ll see in the next section, liquid 
water conducts heat better than steam or hot air.) The boiling point curve ends at a certain point called the 
critical point—that is, a critical temperature, above which the liquid and gas phases cannot be distinguished; 
the substance is called a supercritical fluid. At sufficiently high pressure above the critical point, the gas has the 
density of a liquid but does not condense. Carbon dioxide, for example, is supercritical at all temperatures above 
31.0 °C. Critical pressure is the pressure of the critical point. 


= 
o 
= 
ea 


Boiling 


Pressure, p (atm) 


Sublimation 


0 0.01 100 374 
Temperature, T (°C) 


The phase diagram (pT graph) for water shows solid 
(s), liquid (1), and vapor (v) phases. At temperatures 
and pressure above those of the critical point, there is 
no distinction between liquid and vapor. Note that the 
axes are nonlinear and the graph is not to scale. This 
graph is simplified—it omits several exotic phases of 
ice at higher pressures. The phase diagram of water is 
unusual because the melting-point curve has a negative 
slope, showing that you can melt ice by increasing the 
pressure. 


Similarly, the curve between the solid and liquid regions in [link] gives the melting temperature at various 
pressures. For example, the melting point is 0 °C at 1.00 atm, as expected. Water has the unusual property that 
ice is less dense than liquid water at the melting point, so at a fixed temperature, you can change the phase from 
solid (ice) to liquid (water) by increasing the pressure. That is, the melting temperature of ice falls with 
increased pressure, as the phase diagram shows. For example, when a car is driven over snow, the increased 
pressure from the tires melts the snowflakes; afterwards, the water refreezes and forms an ice layer. 


As you learned in the earlier section on thermometers and temperature scales, the triple point is the combination 
of temperature and pressure at which ice, liquid water, and water vapor can coexist stably—that is, all three 
phases exist in equilibrium. For water, the triple point occurs at 273.16 K (0.01 °C) and 611.2 Pa; that is a 
more accurate calibration temperature than the melting point of water at 1.00 atm, or 273.15 K (0.0 °C). 


Note: 
View this video to see a substance at its triple point. 


At pressures below that of the triple point, there is no liquid phase; the substance can exist as either gas or solid. 
For water, there is no liquid phase at pressures below 0.00600 atm. The phase change from solid to gas is called 


sublimation. You may have noticed that snow can disappear into thin air without a trace of liquid water, or that 
ice cubes can disappear in a freezer. Both are examples of sublimation. The reverse also happens: Frost can form 
on very cold windows without going through the liquid stage. [link] shows the result, as well as showing a 
familiar example of sublimation. Carbon dioxide has no liquid phase at atmospheric pressure. Solid COz is 
known as dry ice because instead of melting, it sublimes. Its sublimation temperature at atmospheric pressure is 
—78 °C. Certain air fresheners use the sublimation of a solid to spread a perfume around a room. Some solids, 
such as osmium tetroxide, are so toxic that they must be kept in sealed containers to prevent human exposure to 
their sublimation-produced vapors. 


(b) 


Direct transitions between solid and vapor are common, sometimes useful, and even beautiful. (a) Dry ice 
sublimes directly to carbon dioxide gas. The visible “smoke” consists of water droplets that condensed in 
the air cooled by the dry ice. (b) Frost forms patterns on a very cold window, an example of a solid formed 
directly from a vapor. (credit a: modification of work by Windell Oskay; credit b: modification of work by 
Liz West) 


Equilibrium 


At the melting temperature, the solid and liquid phases are in equilibrium. If heat is added, some of the solid will 
melt, and if heat is removed, some of the liquid will freeze. The situation is somewhat more complex for liquid- 
gas equilibrium. Generally, liquid and gas are in equilibrium at any temperature. We call the gas phase a vapor 
when it exists at a temperature below the boiling temperature, as it does for water at 20.0 °C. Liquid in a closed 
container at a fixed temperature evaporates until the pressure of the gas reaches a certain value, called the vapor 
pressure, which depends on the gas and the temperature. At this equilibrium, if heat is added, some of the liquid 
will evaporate, and if heat is removed, some of the gas will condense; molecules either join the liquid or form 
suspended droplets. If there is not enough liquid for the gas to reach the vapor pressure in the container, all the 
liquid eventually evaporates. 


If the vapor pressure of the liquid is greater than the total ambient pressure, including that of any air (or other 
gas), the liquid evaporates rapidly; in other words, it boils. Thus, the boiling point of a liquid at a given pressure 
is the temperature at which its vapor pressure equals the ambient pressure. Liquid and gas phases are in 
equilibrium at the boiling temperature ([link]). If a substance is in a closed container at the boiling point, then 
the liquid is boiling and the gas is condensing at the same rate without net change in their amounts. 


Vaporization 
Condensation 


Vaporization 
Condensation 


(a) (b) 


Equilibrium between liquid and gas at two different boiling points inside a closed 
container. (a) The rates of boiling and condensation are equal at this combination 
of temperature and pressure, so the liquid and gas phases are in equilibrium. (b) At 
a higher temperature, the boiling rate is faster, that is, the rate at which molecules 
leave the liquid and enter the gas is faster. This increases the number of molecules 
in the gas, which increases the gas pressure, which in turn increases the rate at 
which gas molecules condense and enter the liquid. The pressure stops increasing 
when it reaches the point where the boiling rate and the condensation rate are 
equal. The gas and liquid are in equilibrium again at this higher temperature and 
pressure. 


For water, 100 °C is the boiling point at 1.00 atm, so water and steam should exist in equilibrium under these 
conditions. Why does an open pot of water at 100 “C boil completely away? The gas surrounding an open pot is 
not pure water: it is mixed with air. If pure water and steam are in a closed container at 100 °C and 1.00 atm, 
they will coexist—but with air over the pot, there are fewer water molecules to condense, and water boils away. 
Another way to see this is that at the boiling point, the vapor pressure equals the ambient pressure. However, 
part of the ambient pressure is due to air, so the pressure of the steam is less than the vapor pressure at that 
temperature, and evaporation continues. Incidentally, the equilibrium vapor pressure of solids is not zero, a fact 
that accounts for sublimation. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Explain why a cup of water (or soda) with ice cubes stays at 0 °C, even ona 
hot summer day. 


Solution: 


The ice and liquid water are in thermal equilibrium, so that the temperature stays at the freezing 
temperature as long as ice remains in the liquid. (Once all of the ice melts, the water temperature will start 
to rise.) 


Phase Change and Latent Heat 


So far, we have discussed heat transfers that cause temperature change. However, in a phase transition, heat 
transfer does not cause any temperature change. 


For an example of phase changes, consider the addition of heat to a sample of ice at —20 °C ({link]) and 
atmospheric pressure. The temperature of the ice rises linearly, absorbing heat at a constant rate of 

2090 J/kg - °C until it reaches 0 °C. Once at this temperature, the ice begins to melt and continues until it has 
all melted, absorbing 333 kJ/kg of heat. The temperature remains constant at 0 °C during this phase change. 
Once all the ice has melted, the temperature of the liquid water rises, absorbing heat at a new constant rate of 
4186 J/kg - °C. At 100 °C, the water begins to boil. The temperature again remains constant during this phase 
change while the water absorbs 2256 kJ/kg of heat and turns into steam. When all the liquid has become steam, 
the temperature rises again, absorbing heat at a rate of 2020 J/kg - °C. If we started with steam and cooled it to 
make it condense into liquid water and freeze into ice, the process would exactly reverse, with the temperature 
again constant during each phase transition. 


Water + Steam 


AQim (kJ/kg) 


Temperature versus heat. The system is constructed so that no vapor evaporates while ice warms 
to become liquid water, and so that, when vaporization occurs, the vapor remains in the system. 
The long stretches of constant temperatures at 0 “C and 100 °C reflect the large amounts of heat 
needed to cause melting and vaporization, respectively. 


Where does the heat added during melting or boiling go, considering that the temperature does not change until 
the transition is complete? Energy is required to melt a solid, because the attractive forces between the 
molecules in the solid must be broken apart, so that in the liquid, the molecules can move around at comparable 
kinetic energies; thus, there is no rise in temperature. Energy is needed to vaporize a liquid for similar reasons. 
Conversely, work is done by attractive forces when molecules are brought together during freezing and 
condensation. That energy must be transferred out of the system, usually in the form of heat, to allow the 
molecules to stay together ([link]). Thus, condensation occurs in association with cold objects—the glass in 
[link], for example. 


Condensation forms on this glass of iced 
tea because the temperature of the nearby 
air is reduced. The air cannot hold as much 
water as it did at room temperature, so 
water condenses. Energy is released when 
the water condenses, speeding the melting 
of the ice in the glass. (credit: Jenny 
Downing) 


The energy released when a liquid freezes is used by orange growers when the temperature approaches 0 °C. 
Growers spray water on the trees so that the water freezes and heat is released to the growing oranges. This 
prevents the temperature inside the orange from dropping below freezing, which would damage the fruit ([link]). 


The ice on these trees released large amounts of energy 

when it froze, helping to prevent the temperature of the 

trees from dropping below 0 °C. Water is intentionally 

sprayed on orchards to help prevent hard frosts. (credit: 
Hermann Hammer) 


The energy involved in a phase change depends on the number of bonds or force pairs and their strength. The 
number of bonds is proportional to the number of molecules and thus to the mass of the sample. The energy per 
unit mass required to change a substance from the solid phase to the liquid phase, or released when the substance 
changes from liquid to solid, is known as the heat of fusion. The energy per unit mass required to change a 
substance from the liquid phase to the vapor phase is known as the heat of vaporization. The strength of the 
forces depends on the type of molecules. The heat Q absorbed or released in a phase change in a sample of mass 
m is given by 


Note: 
Equation: 


Q = mL;(melting /freezing) 


Note: 
Equation: 


Q = mL,(vaporization /condensation) 


where the latent heat of fusion L¢ and latent heat of vaporization Ly are material constants that are determined 
experimentally. (Latent heats are also called latent heat coefficients and heats of transformation.) These 
constants are “latent,” or hidden, because in phase changes, energy enters or leaves a system without causing a 
temperature change in the system, so in effect, the energy is hidden. 


Limits of motion 


Solid 


Liquid 


Energy 


input ea 


Evaporate 


Condense 


Energy eg 


output 


(b) 


Gas 


(a) Energy is required to partially overcome the attractive forces (modeled as springs) between molecules in 
a solid to form a liquid. That same energy must be removed from the liquid for freezing to take place. (b) 
Molecules become separated by large distances when going from liquid to vapor, requiring significant 
energy to completely overcome molecular attraction. The same energy must be removed from the vapor for 
condensation to take place. 


[link] lists representative values of Lr and L, in kJ/kg, together with melting and boiling points. Note that in 


general, L, > L¢. The table shows that the amounts of energy involved in phase changes can easily be 


comparable to or greater than those involved in temperature changes, as [link] and the accompanying discussion 


also showed. 


Substance 


Helium!?! 
Hydrogen 
Nitrogen 
Oxygen 
Ethanol 
Ammonia 
Mercury 


Water 


Melting Point 


(°C) 

—272.2 (0.95 K) 
—259.3 (13.9 K) 
—210.0 (63.2 K) 
—218.8 (54.4K) 


—114 


Ly 


kJ/kg 


kcal/kg 


Boiling Point 
(°C) 


—268.9 (4.2 K) 
—252.9 (20.2 K) 
—195.8 (7.4K) 


—183.0 (90.2 K) 


357 


100.0 


Ly 


kJ/kg 


20.9 
452 
201 
213 
854 

1370 
272 


2256131 


kcal/kg 


4.99 
108 


48.0 


Sulfur 119 38.1 9.10 444.6 326 77.9 
Lead 327 24.5 5.85 1750 871 208 
Antimony 631 165 39.4 1440 561 134 
Aluminum 660 380 90 2450 11400 2720 
Silver 961 88.3 21.1 2193 2336 558 
Gold 1063 64.5 15.4 2660 1578 377 
Copper 1083 134 32.0 2595 5069 1211 
Uranium 1133 84 20 3900 1900 454 
Tungsten 3410 184 44 5900 4810 1150 


Heats of Fusion and Vaporization[1][1]Values quoted at the normal melting and boiling temperatures at standard 
atmospheric pressure (1 atm). [2]Helium has no solid phase at atmospheric pressure. The melting point given is at 
a pressure of 2.5 MPa. [3]At 37.0 °C (body temperature), the heat of vaporization Ly for water is 2430 kJ/kg or 
580 kcal/kg. [4]At 37.0 °C (body temperature), the heat of vaporization, L, for water is 2430 kJ/kg or 580 
kcal/kg. 


Phase changes can have a strong stabilizing effect on temperatures that are not near the melting and boiling 
points, since evaporation and condensation occur even at temperatures below the boiling point. For example, air 
temperatures in humid climates rarely go above approximately 38.0 °C because most heat transfer goes into 
evaporating water into the air. Similarly, temperatures in humid weather rarely fall below the dew point—the 
temperature where condensation occurs given the concentration of water vapor in the air—because so much heat 
is released when water vapor condenses. 


More energy is required to evaporate water below the boiling point than at the boiling point, because the kinetic 
energy of water molecules at temperatures below 100 °C is less than that at 100 °C, so less energy is available 
from random thermal motions. For example, at body temperature, evaporation of sweat from the skin requires a 
heat input of 2428 kJ/kg, which is about 10% higher than the latent heat of vaporization at 100 °C. This heat 
comes from the skin, and this evaporative cooling effect of sweating helps reduce the body temperature in hot 
weather. However, high humidity inhibits evaporation, so that body temperature might rise, while unevaporated 
sweat might be left on your brow. 


Example: 

Calculating Final Temperature from Phase Change 

Three ice cubes are used to chill a soda at 20 “C with mass mgoqga, = 0.25 kg. The ice is at 0 °C and each ice 
cube has a mass of 6.0 g. Assume that the soda is kept in a foam container so that heat loss can be ignored and 
that the soda has the same specific heat as water. Find the final temperature when all ice has melted. 
Strategy 

The ice cubes are at the melting temperature of 0 °C. Heat is transferred from the soda to the ice for melting. 
Melting yields water at 0 °C, so more heat is transferred from the soda to this water until the water plus soda 
system reaches thermal equilibrium. 

The heat transferred to the ice is 

Equation: 


hes = Micelt aE MiceCw (Ty a 0 ely, 


The heat given off by the soda is 


Equation: 

Qsoda = MsodaCw (Ty — 20 eye 
Since no heat is lost, Qice = —Qsoda, as in [link], so that 
Equation: 


Miceli + MiceCw (Ty —0 *C) = —™MsodaCw (Ty — 20 “); 


Solve for the unknown quantity TJ}: 
Equation: 
MsodaCw (20 Oo) — MiceL¢ 


a = 
(Msoda + Miele, 


Solution 

First we identify the known quantities. The mass of ice is Mice = 3 x 6.0 g = 0.018 kg and the mass of soda 
iS Msoda = 0.25 kg. Then we calculate the final temperature: 

Equation: 


_ 20,930 J — 6012 J 


= = 13°C. 
: 1122J/°C 


Significance 

This example illustrates the large energies involved during a phase change. The mass of ice is about 7% of the 
mass of the soda but leads to a noticeable change in the temperature of the soda. Although we assumed that the 
ice was at the freezing temperature, this is unrealistic for ice straight out of a freezer: The typical temperature is 
—6 °C. However, this correction makes no significant change from the result we found. Can you explain why? 


Like solid-liquid and liquid-vapor transitions, direct solid-vapor transitions or sublimations involve heat. The 
energy transferred is given by the equation Q = mL,, where L, is the heat of sublimation, analogous to Lr and 
L,. The heat of sublimation at a given temperature is equal to the heat of fusion plus the heat of vaporization at 
that temperature. 


We can now calculate any number of effects related to temperature and phase change. In each case, it is 
necessary to identify which temperature and phase changes are taking place. Keep in mind that heat transfer and 
work can cause both temperature and phase changes. 


Note: 
Problem-Solving Strategy: The Effects of Heat Transfer 


1. Examine the situation to determine that there is a change in the temperature or phase. Is there heat transfer 
into or out of the system? When it is not obvious whether a phase change occurs or not, you may wish to 
first solve the problem as if there were no phase changes, and examine the temperature change obtained. If 
it is sufficient to take you past a boiling or melting point, you should then go back and do the problem in 
steps—temperature change, phase change, subsequent temperature change, and so on. 

2. Identify and list all objects that change temperature or phase. 

3. Identify exactly what needs to be determined in the problem (identify the unknowns). A written list is 
useful. 


4. Make a list of what is given or what can be inferred from the problem as stated (identify the knowns). If 
there is a temperature change, the transferred heat depends on the specific heat of the substance (Heat 
Transfer, Specific Heat, and Calorimetry), and if there is a phase change, the transferred heat depends on 
the latent heat of the substance ((link]). 

. Solve the appropriate equation for the quantity to be determined (the unknown). 

6. Substitute the knowns along with their units into the appropriate equation and obtain numerical solutions 

complete with units. You may need to do this in steps if there is more than one state to the process, such as 
a temperature change followed by a phase change. However, in a calorimetry problem, each step 
corresponds to a term in the single equation Qpot + Qeola = 0. 

7. Check the answer to see if it is reasonable. Does it make sense? As an example, be certain that any 

temperature change does not also cause a phase change that you have not taken into account. 


uo 


Note: 
Exercise: 


Problem: 


Check Your Understanding Why does snow often remain even when daytime temperatures are higher 
than the freezing temperature? 


Solution: 
Snow is formed from ice crystals and thus is the solid phase of water. Because enormous heat is necessary 


for phase changes, it takes a certain amount of time for this heat to be transferred from the air, even if the 
air is above 0 °C. 


Summary 


¢ Most substances have three distinct phases (under ordinary conditions on Earth), and they depend on 
temperature and pressure. 

e Two phases coexist (i.e., they are in thermal equilibrium) at a set of pressures and temperatures. 

e Phase changes occur at fixed temperatures for a given substance at a given pressure, and these temperatures 
are called boiling, freezing (or melting), and sublimation points. 


Conceptual Questions 


Exercise: 


Problem: 


A pressure cooker contains water and steam in equilibrium at a pressure greater than atmospheric pressure. 
How does this greater pressure increase cooking speed? 


Solution: 


It raises the boiling point, so the water, which the food gains heat from, is at a higher temperature. 


Exercise: 


Problem: 


As shown below, which is the phase diagram for carbon dioxide, what is the vapor pressure of solid carbon 
dioxide (dry ice) at —78.5 “C? (Note that the axes in the figure are nonlinear and the graph is not to scale.) 
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Exercise: 
Problem: 


Can carbon dioxide be liquefied at room temperature (20 °C)? If so, how? If not, why not? (See the phase 
diagram in the preceding problem.) 


Solution: 
Yes, by raising the pressure above 56 atm. 


Exercise: 


Problem: What is the distinction between gas and vapor? 


Exercise: 
Problem: Heat transfer can cause temperature and phase changes. What else can cause these changes? 


Solution: 


work 
Exercise: 
Problem: 
How does the latent heat of fusion of water help slow the decrease of air temperatures, perhaps preventing 
temperatures from falling significantly below 0 °C, in the vicinity of large bodies of water? 


Exercise: 


Problem: What is the temperature of ice right after it is formed by freezing water? 
Solution: 


0 °C (at or near atmospheric pressure) 
Exercise: 
Problem: 
If you place 0 °C ice into 0 °C water in an insulated container, what will the net result be? Will there be 
less ice and more liquid water, or more ice and less liquid water, or will the amounts stay the same? 
Exercise: 
Problem: 


What effect does condensation on a glass of ice water have on the rate at which the ice melts? Will the 
condensation speed up the melting process or slow it down? 


Solution: 


Condensation releases heat, so it speeds up the melting. 
Exercise: 
Problem: 
In Miami, Florida, which has a very humid climate and numerous bodies of water nearby, it is unusual for 
temperatures to rise above about 38 °C (100 °F). In the desert climate of Phoenix, Arizona, however, 


temperatures rise above that almost every day in July and August. Explain how the evaporation of water 
helps limit high temperatures in humid climates. 


Exercise: 
Problem: 
In winter, it is often warmer in San Francisco than in Sacramento, 150 km inland. In summer, it is nearly 


always hotter in Sacramento. Explain how the bodies of water surrounding San Francisco moderate its 
extreme temperatures. 


Solution: 


Because of water’s high specific heat, it changes temperature less than land. Also, evaporation reduces 
temperature rises. The air tends to stay close to equilibrium with the water, so its temperature does not 
change much where there’s a lot of water around, as in San Francisco but not Sacramento. 


Exercise: 
Problem: 
Freeze-dried foods have been dehydrated in a vacuum. During the process, the food freezes and must be 


heated to facilitate dehydration. Explain both how the vacuum speeds up dehydration and why the food 
freezes as a result. 


Exercise: 


Problem: 


In a physics classroom demonstration, an instructor inflates a balloon by mouth and then cools it in liquid 
nitrogen. When cold, the shrunken balloon has a small amount of light blue liquid in it, as well as some 
snow-like crystals. As it warms up, the liquid boils, and part of the crystals sublime, with some crystals 
lingering for a while and then producing a liquid. Identify the blue liquid and the two solids in the cold 
balloon. Justify your identifications using data from [Link]. 


Solution: 


The liquid is oxygen, whose boiling point is above that of nitrogen but whose melting point is below the 
boiling point of liquid nitrogen. The crystals that sublime are carbon dioxide, which has no liquid phase at 
atmospheric pressure. The crystals that melt are water, whose melting point is above carbon dioxide’s 
sublimation point. The water came from the instructor’s breath. 


Problems 


Exercise: 
Problem: 
How much heat transfer (in kilocalories) is required to thaw a 0.450-kg package of frozen vegetables 
originally at 0 °C if their heat of fusion is the same as that of water? 

Exercise: 
Problem: 
A bag containing 0 °C ice is much more effective in absorbing energy than one containing the same 
amount of 0 °C water. (a) How much heat transfer is necessary to raise the temperature of 0.800 kg of 
water from 0 °C to 30.0 °C? (b) How much heat transfer is required to first melt 0.800 kg of 0 °C ice and 


then raise its temperature? (c) Explain how your answer supports the contention that the ice is more 
effective. 


Solution: 


a.1.00 x 10°J;b.3.68 x 10°J; c. The ice is much more effective in absorbing heat because it first must 
be melted, which requires a lot of energy, and then it gains the same amount of heat as the bag that started 
with water. The first 2.67 x 10° J of heat is used to melt the ice, then it absorbs the 1.00 x 10°J of heat 
as water. 


Exercise: 
Problem: 
(a) How much heat transfer is required to raise the temperature of a 0.750-kg aluminum pot containing 2.50 


kg of water from 30.0 °C to the boiling point and then boil away 0.750 kg of water? (b) How long does this 
take if the rate of heat transfer is 500 W? 


Exercise: 
Problem: 
Condensation on a glass of ice water causes the ice to melt faster than it would otherwise. If 8.00 g of vapor 
condense on a glass containing both water and 200 g of ice, how many grams of the ice will melt as a 


result? Assume no other heat transfer occurs. Use L, for water at 37 °C as a better approximation than L, 
for water at 100 °C.) 


Solution: 


58.1 g 
Exercise: 
Problem: 
On a trip, you notice that a 3.50-kg bag of ice lasts an average of one day in your cooler. What is the 


average power in watts entering the ice if it starts at 0 °C and completely melts to 0 °C water in exactly 
one day? 


Exercise: 
Problem: 
On a certain dry sunny day, a swimming pool’s temperature would rise by 1.50 °C if not for evaporation. 


What fraction of the water must evaporate to carry away precisely enough energy to keep the temperature 
constant? 


Solution: 


Let M be the mass of pool water and m be the mass of pool water that evaporates. 


= —  cAT _ (1.00 kcal/kg-°C)(1.50°C) 3. 
McAT = mLy 37 °c) => aT = Dae = “ 580keal/kg = 2.59 x 10 > 
(Note that Ly for water at 37 °C is used here as a better approximation than Ly for 100 °C water.) 
Exercise: 
Problem: 


(a) How much heat transfer is necessary to raise the temperature of a 0.200-kg piece of ice from —20.0 °C 
to 130.0 °C, including the energy needed for phase changes? (b) How much time is required for each stage, 
assuming a constant 20.0 kJ/s rate of heat transfer? (c) Make a graph of temperature versus time for this 
process. 


Exercise: 
Problem: 
In 1986, an enormous iceberg broke away from the Ross Ice Shelf in Antarctica. It was an approximately 
rectangular prism 160 km long, 40.0 km wide, and 250 m thick. (a) What is the mass of this iceberg, given 


that the density of ice is 917 kg/ m?? (b) How much heat transfer (in joules) is needed to melt it? (c) How 


many years would it take sunlight alone to melt ice this thick, if the ice absorbs an average of 100 W/ m?, 
12.00 h per day? 


Solution: 


a. 1.47 x 10%kg;b.4.90 x 107°J;c. 48.5 y 

Exercise: 
Problem: 
How many grams of coffee must evaporate from 350 g of coffee in a 100-g glass cup to cool the coffee and 
the cup from 95.0 °C to 45.0 °C? Assume the coffee has the same thermal properties as water and that the 
average heat of vaporization is 2340 kJ/kg (560 kcal/g). Neglect heat losses through processes other than 


evaporation, as well as the change in mass of the coffee as it cools. Do the latter two assumptions cause 
your answer to be higher or lower than the true answer? 


Exercise: 


Problem: 


(a) It is difficult to extinguish a fire on a crude oil tanker, because each liter of crude oil releases 

2.80 x 10° J of energy when burned. To illustrate this difficulty, calculate the number of liters of water 
that must be expended to absorb the energy released by burning 1.00 L of crude oil, if the water’s 
temperature rises from 20.0 °C to 100 °C, it boils, and the resulting steam’s temperature rises to 300 °C at 
constant pressure. (b) Discuss additional complications caused by the fact that crude oil is less dense than 
water. 


Solution: 


a. 9.35 L; b. Crude oil is less dense than water, so it floats on top of the water, thereby exposing it to the 
oxygen in the air, which it uses to burn. Also, if the water is under the oil, it is less able to absorb the heat 
generated by the oil. 


Exercise: 
Problem: 
The energy released from condensation in thunderstorms can be very large. Calculate the energy released 


into the atmosphere for a small storm of radius 1 km, assuming that 1.0 cm of rain is precipitated uniformly 
over this area. 


Exercise: 
Problem: 
To help prevent frost damage, 4.00 kg of water at 0 °C is sprayed onto a fruit tree. (a) How much heat 
transfer occurs as the water freezes? (b) How much would the temperature of the 200-kg tree decrease if 


this amount of heat transferred from the tree? Take the specific heat to be 3.35 kJ/kg - °C, and assume that 
no phase change occurs in the tree. 


Solution: 


a. 319 kcal; b. 2.00 °C 
Exercise: 
Problem: 
A 0.250-kg aluminum bowl holding 0.800 kg of soup at 25.0 °C is placed in a freezer. What is the final 


temperature if 388 kJ of energy is transferred from the bowl and soup, assuming the soup’s thermal 
properties are the same as that of water? 


Exercise: 
Problem: 


A 0.0500-kg ice cube at —30.0 °C is placed in 0.400 kg of 35.0-°C water in a very well-insulated 
container. What is the final temperature? 


Solution: 


First bring the ice up to 0 °C and melt it with heat Q, : 4.74 kcal. This lowers the temperature of water by 
AT» : 23.15 °C. Now, the heat lost by the hot water equals that gained by the cold water (7; is the final 
temperature): 20.6 °C 


Exercise: 


Problem: 


If you pour 0.0100 kg of 20.0 °C water onto a 1.20-kg block of ice (which is initially at —15.0 °C), what 
is the final temperature? You may assume that the water cools so rapidly that effects of the surroundings are 
negligible. 

Exercise: 
Problem: 
Indigenous people sometimes cook in watertight baskets by placing hot rocks into water to bring it to a 
boil. What mass of 500-°C granite must be placed in 4.00 kg of 15.0-°C water to bring its temperature to 


100 °C, if 0.0250 kg of water escapes as vapor from the initial sizzle? You may neglect the effects of the 
surroundings. 


Solution: 


Let the subscripts r, e, v, and w represent rock, equilibrium, vapor, and water, respectively. 
myc; (T; — Te) = myLy + mwew (Te — T2); 


myLy+mwyew( Te = T») 


mm, = 
c,(Ti—Te) 
__ (0.0250 kg) (2256 x 10° J/kg) +(3.975 kg) (4186 x 10° J/kg-°C) (100 “C—15 °C) 
~~ (840 J/kg: °C)(500 *C—100 °C) 
= 4.38 kg 
Exercise: 
Problem: 


What would the final temperature of the pan and water be in [link] if 0.260 kg of water were placed in the 
pan and 0.0100 kg of the water evaporated immediately, leaving the remainder to come to a common 
temperature with the pan? 


Glossary 


critical point 
for a given substance, the combination of temperature and pressure above which the liquid and gas phases 
are indistinguishable 


critical pressure 
pressure at the critical point 


critical temperature 
temperature at the critical point 


heat of fusion 
energy per unit mass required to change a substance from the solid phase to the liquid phase, or released 
when the substance changes from liquid to solid 


heat of sublimation 
energy per unit mass required to change a substance from the solid phase to the vapor phase 


heat of vaporization 
energy per unit mass required to change a substance from the liquid phase to the vapor phase 


latent heat coefficient 
general term for the heats of fusion, vaporization, and sublimation 


phase diagram 
graph of pressure vs. temperature of a particular substance, showing at which pressures and temperatures 
the phases of the substance occur 


sublimation 
phase change from solid to gas 


vapor 
gas at a temperature below the boiling temperature 


vapor pressure 
pressure at which a gas coexists with its solid or liquid phase 


Mechanisms of Heat Transfer 
By the end of this section, you will be able to: 


e Explain some phenomena that involve conductive, convective, and radiative heat 
transfer 

¢ Solve problems on the relationships between heat transfer, time, and rate of heat 
transfer 

¢ Solve problems using the formulas for conduction and radiation 


Just as interesting as the effects of heat transfer on a system are the methods by which it 
occurs. Whenever there is a temperature difference, heat transfer occurs. It may occur 
rapidly, as through a cooking pan, or slowly, as through the walls of a picnic ice chest. 
So many processes involve heat transfer that it is hard to imagine a situation where no 
heat transfer occurs. Yet every heat transfer takes place by only three methods: 


1. Conduction is heat transfer through stationary matter by physical contact. (The 
matter is stationary on a macroscopic scale—we know that thermal motion of the 
atoms and molecules occurs at any temperature above absolute zero.) Heat 
transferred from the burner of a stove through the bottom of a pan to food in the 
pan is transferred by conduction. 

2. Convection is the heat transfer by the macroscopic movement of a fluid. This type 
of transfer takes place in a forced-air furnace and in weather systems, for example. 

3. Heat transfer by radiation occurs when microwaves, infrared radiation, visible 
light, or another form of electromagnetic radiation is emitted or absorbed. An 
obvious example is the warming of Earth by the Sun. A less obvious example is 
thermal radiation from the human body. 


In the illustration at the beginning of this chapter, the fire warms the snowshoers’ faces 
largely by radiation. Convection carries some heat to them, but most of the air flow from 
the fire is upward (creating the familiar shape of flames), carrying heat to the food being 
cooked and into the sky. The snowshoers wear clothes designed with low conductivity 
to prevent heat flow out of their bodies. 


In this section, we examine these methods in some detail. Each method has unique and 
interesting characteristics, but all three have two things in common: They transfer heat 
solely because of a temperature difference, and the greater the temperature difference, 
the faster the heat transfer ([link]). 


Convection 
around windows 
and doors 

(cold air) 


Convection (hot air) 


In a fireplace, heat transfer occurs by all three methods: 
conduction, convection, and radiation. Radiation is responsible 
for most of the heat transferred into the room. Heat transfer also 
occurs through conduction into the room, but much slower. Heat 
transfer by convection also occurs through cold air entering the 
room around windows and hot air leaving the room by rising up 
the chimney. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Name an example from daily life (different from the 
text) for each mechanism of heat transfer. 


Solution: 


Conduction: Heat transfers into your hands as you hold a hot cup of coffee. 
Convection: Heat transfers as the barista “steams” cold milk to make hot cocoa. 
Radiation: Heat transfers from the Sun to a jar of water with tea leaves in it to 
make “Sun tea.” A great many other answers are possible. 


Conduction 


As you walk barefoot across the living room carpet in a cold house and then step onto 
the kitchen tile floor, your feet feel colder on the tile. This result is intriguing, since the 
carpet and tile floor are both at the same temperature. The different sensation is 
explained by the different rates of heat transfer: The heat loss is faster for skin in contact 
with the tiles than with the carpet, so the sensation of cold is more intense. 


Some materials conduct thermal energy faster than others. [link] shows a material that 
conducts heat slowly—it is a good thermal insulator, or poor heat conductor—used to 
reduce heat flow into and out of a house. 


Insulation is used to limit the 
conduction of heat from the inside to the 
outside (in winter) and from the outside 
to the inside (in summer). (credit: Giles 

Douglas) 


A molecular picture of heat conduction will help justify the equation that describes it. 
[link] shows molecules in two bodies at different temperatures, 7}, and T., for “hot” and 
“cold.” The average kinetic energy of a molecule in the hot body is higher than in the 
colder body. If two molecules collide, energy transfers from the high-energy to the low- 
energy molecule. In a metal, the picture would also include free valence electrons 
colliding with each other and with atoms, likewise transferring energy. The cumulative 
effect of all collisions is a net flux of heat from the hotter body to the colder body. Thus, 
the rate of heat transfer increases with increasing temperature difference 

AT = T;, — T.. If the temperatures are the same, the net heat transfer rate is zero. 


Because the number of collisions increases with increasing area, heat conduction is 
proportional to the cross-sectional area—a second factor in the equation. 


Surface 
Th Low energy Fe 
SS - before collision 

Higher ‘SK Lower 
temperature AT \ temperature 
High energy A, 

before collision 

Heat 
conduction 


Molecules in two bodies at different temperatures 
have different average kinetic energies. Collisions 
occurring at the contact surface tend to transfer 
energy from high-temperature regions to low- 
temperature regions. In this illustration, a molecule 
in the lower-temperature region (right side) has low 
energy before collision, but its energy increases after 
colliding with a high-energy molecule at the contact 
surface. In contrast, a molecule in the higher- 
temperature region (left side) has high energy before 
collision, but its energy decreases after colliding 
with a low-energy molecule at the contact surface. 


A third quantity that affects the conduction rate is the thickness of the material through 
which heat transfers. [link] shows a slab of material with a higher temperature on the 
left than on the right. Heat transfers from the left to the right by a series of molecular 
collisions. The greater the distance between hot and cold, the more time the material 
takes to transfer the same amount of heat. 


Material having 
thermal conductivity k 


AreaA 


tuo Te 


Heat conduction occurs through any material, represented here by a 
rectangular bar, whether window glass or walrus blubber. 


All four of these quantities appear in a simple equation deduced from and confirmed by 
experiments. The rate of conductive heat transfer through a slab of material, such as 
the one in [link], is given by 


Note: 
Equation: 


p— 9 _ EAC — os) 
dt d 


where P is the power or rate of heat transfer in watts or in kilocalories per second, A and 
d are its surface area and thickness, as shown in [link], 7}, — T is the temperature 
difference across the slab, and k is the thermal conductivity of the material. [link] gives 
representative values of thermal conductivity. 


More generally, we can write 
Equation: 


pana, 
dx 


where x is the coordinate in the direction of heat flow. Since in [link], the power and 
area are constant, dT/dx is constant, and the temperature decreases linearly from 7}, to 
fis . 


Substance Thermal Conductivity k (W/m - °C) 
Diamond 2000 
Silver A420 
Copper 390 

Gold 318 
Aluminum 220 

Steel iron 80 

Steel (stainless) 14 

Ice 2.2 

Glass (average) 0.84 
Concrete brick 0.84 
Water 0.6 

Fatty tissue (without blood) 0.2 
Asbestos 0.16 
Plasterboard 0.16 
Wood 0.08—0.16 


Snow (dry) 0.10 


Substance Thermal Conductivity k (W/m - °C) 


Cork 0.042 
Glass wool 0.042 
Wool 0.04 

Down feathers 0.025 
Air 0.023 
Polystyrene foam 0.010 


Thermal Conductivities of Common Substances Values are given for temperatures near 
0°C. 


Example: 

Calculating Heat Transfer through Conduction 

A polystyrene foam icebox has a total area of 0.950 m? and walls with an average 
thickness of 2.50 cm. The box contains ice, water, and canned beverages at 0 °C. The 
inside of the box is kept cold by melting ice. How much ice melts in one day if the 
icebox is kept in the trunk of a car at 35.0 °C? 

Strategy 

This question involves both heat for a phase change (melting of ice) and the transfer of 
heat by conduction. To find the amount of ice melted, we must find the net heat 
transferred. This value can be obtained by calculating the rate of heat transfer by 
conduction and multiplying by time. 

Solution 

First we identify the knowns. 

k = 0.010 W/m - °C for polystyrene foam; A = 0.950 m?; 

d= "2 00cm 0.0250 mie a0 Osi Bon ©: 

t = 1 day = 24 hours - 86,400 s. 

Then we identify the unknowns. We need to solve for the mass of the ice, m. We also 
need to solve for the net heat transferred to melt the ice, Q. The rate of heat transfer by 
conduction is given by 

Equation: 


p_ 2@ _ A(T -T) 


dt d 


The heat used to melt the ice is Q = mL¢.We insert the known values: 
Equation: 


. (0.010 W/m- °C) (0.950 m?”) (35.0 °C — 0 °C) 
7 0.0250 m 


= 13.3 W. 


Multiplying the rate of heat transfer by the time we obtain 
Equation: 


Q = Pt = (13.3 W) (86.400s) = 1.15 x 10°J. 


We set this equal to the heat transferred to melt the ice, Q = mI +z, and solve for the 
mass m: 
Equation: 


6 
a Q _ iS. x 10" J Shes 
I¢ 334 x 10° J/kg 
Significance 
The result of 3.44 kg, or about 7.6 lb, seems about right, based on experience. You 
might expect to use about a 4 kg (7-10 Ib) bag of ice per day. A little extra ice is 
required if you add any warm food or beverages. 
[link] shows that polystyrene foam is a very poor conductor and thus a good insulator. 
Other good insulators include fiberglass, wool, and goosedown feathers. Like 
polystyrene foam, these all contain many small pockets of air, taking advantage of air’s 
poor thermal conductivity. 


In developing insulation, the smaller the conductivity k and the larger the thickness d, 
the better. Thus, the ratio d/k, called the R factor, is large for a good insulator. The rate 
of conductive heat transfer is inversely proportional to R. R factors are most commonly 
quoted for household insulation, refrigerators, and the like. Unfortunately, in the United 
States, R is still in non-metric units of ft? - °F - h /Btu, although the unit usually goes 
unstated [1 British thermal unit (Btu) is the amount of energy needed to change the 
temperature of 1.0 lb of water by 1.0 °F, which is 1055.1 J]. A couple of representative 
values are an R factor of 11 for 3.5-inch-thick fiberglass batts (pieces) of insulation and 
an R factor of 19 for 6.5-inch-thick fiberglass batts ({link]). In the US, walls are usually 
insulated with 3.5-inch batts, whereas ceilings are usually insulated with 6.5-inch batts. 
In cold climates, thicker batts may be used. 


The fiberglass batt is used for insulation of walls and 
ceilings to prevent heat transfer between the inside of 
the building and the outside environment. (credit: 
Tracey Nicholls) 


Note that in [link], most of the best thermal conductors—-silver, copper, gold, and 
aluminum—are also the best electrical conductors, because they contain many free 
electrons that can transport thermal energy. (Diamond, an electrical insulator, conducts 
heat by atomic vibrations.) Cooking utensils are typically made from good conductors, 
but the handles of those used on the stove are made from good insulators (bad 
conductors). 


Example: 
Two Conductors End to End 


A steel rod and an aluminum rod, each of diameter 1.00 cm and length 25.0 cm, are 
welded end to end. One end of the steel rod is placed in a large tank of boiling water at 
100 °C, while the far end of the aluminum rod is placed in a large tank of water at 

20 °C. The rods are insulated so that no heat escapes from their surfaces. What is the 
temperature at the joint, and what is the rate of heat conduction through this composite 
rod? 

Strategy 

The heat that enters the steel rod from the boiling water has no place to go but through 
the steel rod, then through the aluminum rod, to the cold water. Therefore, we can 
equate the rate of conduction through the steel to the rate of conduction through the 
aluminum. 

We repeat the calculation with a second method, in which we use the thermal resistance 
R of the rod, since it simply adds when two rods are joined end to end. (We will use a 
similar method in the chapter on direct-current circuits.) 

Solution 


1. Identify the knowns and convert them to SI units. 
The length of each rod is Lay = Lgtece) = 0.25 m, the cross-sectional area of each 
rod is Aa = Astee) = 7.85 x 107° m?, the thermal conductivity of aluminum is 
kai = 220 W/m.- °C, the thermal conductivity of steel is kstee) = 80 W/m- °C, 
the temperature at the hot end is J’ = 100 °C, and the temperature at the cold end 
Tie Giga Wa Oe 

2. Calculate the heat-conduction rate through the steel rod and the heat-conduction 
rate through the aluminum rod in terms of the unknown temperature T at the joint: 


Equation: 
Rivage 
ed = steel ey steel 
__ (80 W/m-*C) (7.85 x 10°-° m”) (100 °C—T) 
=e 0.25 m 
= (0.0251 W/*C) (100 °C — T); 
Equation: 
ky Am AT. 
Pay = ‘Al Te Al 
__ (220 W/m-°C)(7.85 x 107° m?) (T—20 °C) 


0.25m 
= (0.0691 W/°C) (T — 20°C). 


3. Set the two rates equal and solve for the unknown temperature: 
Equation: 


(0.0691 W/°C) (T — 20°C) (0.0251 W/°C) (100 °C — T) 
T 222A 3 °C: 


4. Calculate either rate: 
Equation: 


Poteet — (0025 IW (C00; CAC) VAT 
5. If desired, check your answer by calculating the other rate. 
Solution 


1. Recall that R = L/k. Now P = AAT/R, or AT = PR/A. 
2. We know that AT ytee) + AT; = 100 “C — 20 °C = 80 °C. We also know that 
Pstee1 = Pa), and we denote that rate of heat flow by P. Combine the equations: 


Equation: 
P Rsteel Ug Ra ° 
ee, 
A A 
Thus, we can simply add R factors. Now, P = EET 


3. Find the R, from the known quantities: 
Equation: 


Hie Osh eben OU 


steel 


and 
Equation: 


Hea ita adam. Cy WW. 


4. Substitute these values in to find P = 1.47 W as before. 


5. Determine AT for the aluminum rod (or for the steel rod) and use it to find T at 
the joint. 
Equation: 
PR Ay W(t 10S me CW 
AT, = al _ | ) ) = 21.3°C, 


A 7.85 x 107-> m2 


so T = 20 °C + 21.3 °C = 41.3 °C, as in Solution 1. 
6. If desired, check by determining AT for the other rod. 


Significance 

In practice, adding R values is common, as in calculating the R value of an insulated 
wall. In the analogous situation in electronics, the resistance corresponds to AR in this 
problem and is additive even when the areas are unequal, as is common in electronics. 
Our equation for heat conduction can be used only when the areas are equal; otherwise, 
we would have a problem in three-dimensional heat flow, which is beyond our scope. 


Note: 
Exercise: 


Problem: 


Check Your Understanding How does the rate of heat transfer by conduction 
change when all spatial dimensions are doubled? 


Solution: 


Because area is the product of two spatial dimensions, it increases by a factor of 
four when each dimension is doubled (Artin = (2d)? = 4d? = 4Ainitiat ). The 


distance, however, simply doubles. Because the temperature difference and the 
coefficient of thermal conductivity are independent of the spatial dimensions, the 
rate of heat transfer by conduction increases by a factor of four divided by two, or 


two: 
kAgina(Tn—Te) = k(4Aginai(Th—T)) = y kAgina(Tn—Te) 
deinal 2dinitial dinitial 


a SI A 


Conduction is caused by the random motion of atoms and molecules. As such, it is an 
ineffective mechanism for heat transport over macroscopic distances and short times. 
For example, the temperature on Earth would be unbearably cold during the night and 
extremely hot during the day if heat transport in the atmosphere were only through 
conduction. Also, car engines would overheat unless there was a more efficient way to 
remove excess heat from the pistons. The next module discusses the important heat- 
transfer mechanism in such situations. 


Convection 


In convection, thermal energy is carried by the large-scale flow of matter. It can be 
divided into two types. In forced convection, the flow is driven by fans, pumps, and the 
like. A simple example is a fan that blows air past you in hot surroundings and cools 


you by replacing the air heated by your body with cooler air. A more complicated 
example is the cooling system of a typical car, in which a pump moves coolant through 
the radiator and engine to cool the engine and a fan blows air to cool the radiator. 


In free or natural convection, the flow is driven by buoyant forces: hot fluid rises and 
cold fluid sinks because density decreases as temperature increases. The house in [link] 
is kept warm by natural convection, as is the pot of water on the stove in [link]. Ocean 
currents and large-scale atmospheric circulation, which result from the buoyancy of 
warm air and water, transfer hot air from the tropics toward the poles and cold air from 
the poles toward the tropics. (Earth’s rotation interacts with those flows, causing the 
observed eastward flow of air in the temperate zones.) 


Air heated by a so-called gravity furnace expands and rises, 
forming a convective loop that transfers energy to other parts 
of the room. As the air is cooled at the ceiling and outside 
walls, it contracts, eventually becoming denser than room air 
and sinking to the floor. A properly designed heating system 
using natural convection, like this one, can heat a home quite 
efficiently. 


Hot water rises 


Cooler 
water sinks 


Natural convection plays an important 
role in heat transfer inside this pot of 
water. Once conducted to the inside, heat 
transfer to other parts of the pot is mostly 
by convection. The hotter water expands, 
decreases in density, and rises to transfer 
heat to other regions of the water, while 
colder water sinks to the bottom. This 
process keeps repeating. 


Note: 
Natural convection like that of [link] and [link], but acting on rock in Earth’s mantle, 
drives plate tectonics that are the motions that have shaped Earth’s surface. 


Convection is usually more complicated than conduction. Beyond noting that the 
convection rate is often approximately proportional to the temperature difference, we 
will not do any quantitative work comparable to the formula for conduction. However, 
we can describe convection qualitatively and relate convection rates to heat and time. 
Air is a poor conductor, so convection dominates heat transfer by air. Therefore, the 
amount of available space for airflow determines whether air transfers heat rapidly or 
slowly. There is little heat transfer in a space filled with air with a small amount of other 
material that prevents flow. The space between the inside and outside walls of a typical 
American house, for example, is about 9 cm (3.5 in.)—large enough for convection to 
work effectively. The addition of wall insulation prevents airflow, so heat loss (or gain) 


is decreased. On the other hand, the gap between the two panes of a double-paned 
window is about 1 cm, which largely prevents convection and takes advantage of air’s 
low conductivity reduce heat loss. Fur, cloth, and fiberglass also take advantage of the 
low conductivity of air by trapping it in spaces too small to support convection ([link]). 


Many 
convection 
loops 


Air 
(cold) 


Fur is filled with air, breaking it up into 
many small pockets. Convection is very 
slow here, because the loops are so 
small. The low conductivity of air 
makes fur a very good lightweight 
insulator. 


Some interesting phenomena happen when convection is accompanied by a phase 
change. The combination allows us to cool off by sweating even if the temperature of 
the surrounding air exceeds body temperature. Heat from the skin is required for sweat 
to evaporate from the skin, but without air flow, the air becomes saturated and 
evaporation stops. Air flow caused by convection replaces the saturated air by dry air 
and evaporation continues. 


Example: 

Calculating the Flow of Mass during Convection 

The average person produces heat at the rate of about 120 W when at rest. At what rate 
must water evaporate from the body to get rid of all this energy? (For simplicity, we 
assume this evaporation occurs when a person is sitting in the shade and surrounding 
temperatures are the same as skin temperature, eliminating heat transfer by other 
methods.) 

Strategy 

Energy is needed for this phase change (Q = mL,). Thus, the energy loss per unit time 
is 

Equation: 


ii 
Q — = 120 W = 120 J/s. 


t 


We divide both sides of the equation by L, to find that the mass evaporated per unit 
time is 


Equation: 
m _ 120J/s 
- 1 Si 
Solution 
Insert the value of the latent heat from [link], L, = 2430 kJ/kg = 2430 J/g. This 
yields 
Equation: 
120 J 
ihe ib = 0.0494 g/s = 2.96 g/min. 
t  2430J/¢ 
Significance 


Evaporating about 3 g/min seems reasonable. This would be about 180 g (about 7 oz.) 
per hour. If the air is very dry, the sweat may evaporate without even being noticed. A 
significant amount of evaporation also takes place in the lungs and breathing passages. 


Another important example of the combination of phase change and convection occurs 
when water evaporates from the oceans. Heat is removed from the ocean when water 
evaporates. If the water vapor condenses in liquid droplets as clouds form, possibly far 
from the ocean, heat is released in the atmosphere. Thus, there is an overall transfer of 
heat from the ocean to the atmosphere. This process is the driving power behind 
thunderheads, those great cumulus clouds that rise as much as 20.0 km into the 
stratosphere ((link]). Water vapor carried in by convection condenses, releasing 


tremendous amounts of energy. This energy causes the air to expand and rise to colder 
altitudes. More condensation occurs in these regions, which in turn drives the cloud 
even higher. This mechanism is an example of positive feedback, since the process 
reinforces and accelerates itself. It sometimes produces violent storms, with lightning 
and hail. The same mechanism drives hurricanes. 


Note: 
This time-lapse video shows convection currents in a thunderstorm, including “rolling” 
motion similar to that of boiling water. 


Cumulus clouds are caused by water vapor that rises 
because of convection. The rise of clouds is driven by a 
positive feedback mechanism. (credit: 
“Amada44”/Wikimedia Commons) 


Note: 
Exercise: 


Problem: 


Check Your Understanding Explain why using a fan in the summer feels 
refreshing. 


Solution: 


Using a fan increases the flow of air: Warm air near your body is replaced by 
cooler air from elsewhere. Convection increases the rate of heat transfer so that 
moving air “feels” cooler than still air. 


Radiation 


You can feel the heat transfer from the Sun. The space between Earth and the Sun is 
largely empty, so the Sun warms us without any possibility of heat transfer by 
convection or conduction. Similarly, you can sometimes tell that the oven is hot without 
touching its door or looking inside—it may just warm you as you walk by. In these 
examples, heat is transferred by radiation ([link]). That is, the hot body emits 
electromagnetic waves that are absorbed by the skin. No medium is required for 
electromagnetic waves to propagate. Different names are used for electromagnetic 
waves of different wavelengths: radio waves, microwaves, infrared radiation, visible 
light, ultraviolet radiation, X-rays, and gamma rays. 


Most of the heat transfer from this fire to the observers 


occurs through infrared radiation. The visible light, 
although dramatic, transfers relatively little thermal 
energy. Convection transfers energy away from the 
observers as hot air rises, while conduction is 
negligibly slow here. Skin is very sensitive to infrared 
radiation, so you can sense the presence of a fire 
without looking at it directly. (credit: Daniel O’ Neil) 


The energy of electromagnetic radiation varies over a wide range, depending on the 
wavelength: A shorter wavelength (or higher frequency) corresponds to a higher energy. 
Because more heat is radiated at higher temperatures, higher temperatures produce more 
intensity at every wavelength but especially at shorter wavelengths. In visible light, 
wavelength determines color—red has the longest wavelength and violet the shortest— 
so a temperature change is accompanied by a color change. For example, an electric 
heating element on a stove glows from red to orange, while the higher-temperature steel 
in a blast furnace glows from yellow to white. Infrared radiation is the predominant 
form radiated by objects cooler than the electric element and the steel. The radiated 
energy as a function of wavelength depends on its intensity, which is represented in 
[link] by the height of the distribution. (Electromagnetic Waves explains more about the 
electromagnetic spectrum, and Photons and Matter Waves discusses why the decrease in 
wavelength corresponds to an increase in energy.) 
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(a) A graph of the spectrum of electromagnetic waves emitted from an ideal 
radiator at three different temperatures. The intensity or rate of radiation emission 
increases dramatically with temperature, and the spectrum shifts down in 
wavelength toward the visible and ultraviolet parts of the spectrum. The shaded 
portion denotes the visible part of the spectrum. It is apparent that the shift toward 
the ultraviolet with temperature makes the visible appearance shift from red to 
white to blue as temperature increases. (b) Note the variations in color 
corresponding to variations in flame temperature. 


The rate of heat transfer by radiation also depends on the object’s color. Black is the 
most effective, and white is the least effective. On a clear summer day, black asphalt in a 
parking lot is hotter than adjacent gray sidewalk, because black absorbs better than gray 
([link]). The reverse is also true—black radiates better than gray. Thus, on a clear 
summer night, the asphalt is colder than the gray sidewalk, because black radiates the 
energy more rapidly than gray. A perfectly black object would be an ideal radiator and 
an ideal absorber, as it would capture all the radiation that falls on it. In contrast, a 
perfectly white object or a perfect mirror would reflect all radiation, and a perfectly 
transparent object would transmit it all ({link]). Such objects would not emit any 
radiation. Mathematically, the color is represented by the emissivity e. A “blackbody” 
radiator would have an e = 1, whereas a perfect reflector or transmitter would have 


e = 0. For real examples, tungsten light bulb filaments have an e of about 0.5, and 
carbon black (a material used in printer toner) has an emissivity of about 0.95. 


The darker pavement is hotter than the lighter pavement (much more of the ice on 
the right has melted), although both have been in the sunlight for the same time. 
The thermal conductivities of the pavements are the same. 
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A black object is a good absorber and a good radiator, whereas a white, clear, or 
silver object is a poor absorber and a poor radiator. 


To see that, consider a silver object and a black object that can exchange heat by 
radiation and are in thermal equilibrium. We know from experience that they will stay in 
equilibrium (the result of a principle that will be discussed at length in Second Law of 
Thermodynamics). For the black object’s temperature to stay constant, it must emit as 
much radiation as it absorbs, so it must be as good at radiating as absorbing. Similar 
considerations show that the silver object must radiate as little as it absorbs. Thus, one 
property, emissivity, controls both radiation and absorption. 


Finally, the radiated heat is proportional to the object’s surface area, since every part of 
the surface radiates. If you knock apart the coals of a fire, the radiation increases 
noticeably due to an increase in radiating surface area. 


The rate of heat transfer by emitted radiation is described by the Stefan-Boltzmann law 
of radiation: 
Equation: 


P =oAeT", 


where o = 5.67 x 10-8 J/s- m7? - K‘ is the Stefan-Boltzmann constant, a 
combination of fundamental constants of nature; A is the surface area of the object; and 
T is its temperature in kelvins. 


The proportionality to the fourth power of the absolute temperature is a remarkably 
strong temperature dependence. It allows the detection of even small temperature 
variations. Images called thermographs can be used medically to detect regions of 
abnormally high temperature in the body, perhaps indicative of disease. Similar 
techniques can be used to detect heat leaks in homes ([link]), optimize performance of 
blast furnaces, improve comfort levels in work environments, and even remotely map 
Earth’s temperature profile. 


A thermograph of part of a building shows temperature variations, 
indicating where heat transfer to the outside is most severe. Windows are 
a major region of heat transfer to the outside of homes. (credit: US 
Army) 


The Stefan-Boltzmann equation needs only slight refinement to deal with a simple case 
of an object’s absorption of radiation from its surroundings. Assuming that an object 
with a temperature 7), is surrounded by an environment with uniform temperature T%, 
the net rate of heat transfer by radiation is 


Note: 
Equation: 


Pret = oeA (T2* — T,*), 


where e is the emissivity of the object alone. In other words, it does not matter whether 
the surroundings are white, gray, or black: The balance of radiation into and out of the 
object depends on how well it emits and absorbs radiation. When T) > 7}, the quantity 
Pret is positive, that is, the net heat transfer is from hot to cold. 


Before doing an example, we have a complication to discuss: different emissivities at 
different wavelengths. If the fraction of incident radiation an object reflects is the same 
at all visible wavelengths, the object is gray; if the fraction depends on the wavelength, 
the object has some other color. For instance, a red or reddish object reflects red light 
more strongly than other visible wavelengths. Because it absorbs less red, it radiates less 
red when hot. Differential reflection and absorption of wavelengths outside the visible 
range have no effect on what we see, but they may have physically important effects. 
Skin is a very good absorber and emitter of infrared radiation, having an emissivity of 
0.97 in the infrared spectrum. Thus, in spite of the obvious variations in skin color, we 
are all nearly black in the infrared. This high infrared emissivity is why we can so easily 
feel radiation on our skin. It is also the basis for the effectiveness of night-vision scopes 
used by law enforcement and the military to detect human beings. 


Example: 

Calculating the Net Heat Transfer of a Person 

What is the rate of heat transfer by radiation of an unclothed person standing in a dark 
room whose ambient temperature is 22.0 °C? The person has a normal skin 
temperature of 33.0 °C and a surface area of 1.50 m?. The emissivity of skin is 0.97 in 
the infrared, the part of the spectrum where the radiation takes place. 

Strategy 

We can solve this by using the equation for the rate of radiative heat transfer. 
Solution 

Insert the temperature values Ty = 295 K and T; = 306 K, so that 

Equation: 


2 =ceA (Ts; —T;') 


(5.67 x 10-8 J/s-m? -K*) (0.97) (1.50 m?) |(295 K)* — (306 K)*| 
= -99J/s = —99 W. 


Significance 

This value is a significant rate of heat transfer to the environment (note the minus sign), 
considering that a person at rest may produce energy at the rate of 125 W and that 
conduction and convection are also transferring energy to the environment. Indeed, we 
would probably expect this person to feel cold. Clothing significantly reduces heat 
transfer to the environment by all mechanisms, because clothing slows down both 


conduction and convection, and has a lower emissivity (especially if it is light-colored) 
than skin. 


The average temperature of Earth is the subject of much current discussion. Earth is in 
radiative contact with both the Sun and dark space, so we cannot use the equation for an 
environment at a uniform temperature. Earth receives almost all its energy from 
radiation of the Sun and reflects some of it back into outer space. Conversely, dark space 
is very cold, about 3 K, so that Earth radiates energy into the dark sky. The rate of heat 
transfer from soil and grasses can be so rapid that frost may occur on clear summer 
evenings, even in warm latitudes. 


The average temperature of Earth is determined by its energy balance. To a first 
approximation, it is the temperature at which Earth radiates heat to space as fast as it 
receives energy from the Sun. 


An important parameter in calculating the temperature of Earth is its emissivity (e). On 
average, it is about 0.65, but calculation of this value is complicated by the great day-to- 
day variation in the highly reflective cloud coverage. Because clouds have lower 
emissivity than either oceans or land masses, they reflect some of the radiation back to 
the surface, greatly reducing heat transfer into dark space, just as they greatly reduce 
heat transfer into the atmosphere during the day. There is negative feedback (in which a 
change produces an effect that opposes that change) between clouds and heat transfer; 
higher temperatures evaporate more water to form more clouds, which reflect more 
radiation back into space, reducing the temperature. 


The often-mentioned greenhouse effect is directly related to the variation of Earth’s 
emissivity with wavelength ({link]). The greenhouse effect is a natural phenomenon 
responsible for providing temperatures suitable for life on Earth and for making Venus 
unsuitable for human life. Most of the infrared radiation emitted from Earth is absorbed 
by carbon dioxide (CO2) and water (H2O) in the atmosphere and then re-radiated into 
outer space or back to Earth. Re-radiation back to Earth maintains its surface 
temperature about 40 °C higher than it would be if there were no atmosphere. (The 
glass walls and roof of a greenhouse increase the temperature inside by blocking 
convective heat losses, not radiative losses.) 


IR 


The greenhouse effect is the name given to the increase of Earth’s 
temperature due to absorption of radiation in the atmosphere. The 
atmosphere is transparent to incoming visible radiation and most of the 
Sun’s infrared. The Earth absorbs that energy and re-emits it. Since Earth’s 
temperature is much lower than the Sun’s, it re-emits the energy at much 
longer wavelengths, in the infrared. The atmosphere absorbs much of that 
infrared radiation and radiates about half of the energy back down, keeping 
Earth warmer than it would otherwise be. The amount of trapping depends 
on concentrations of trace gases such as carbon dioxide, and an increase in 
the concentration of these gases increases Earth’s surface temperature. 


The greenhouse effect is central to the discussion of global warming due to emission of 
carbon dioxide and methane (and other greenhouse gases) into Earth’s atmosphere from 
industry, transportation, and farming. Changes in global climate could lead to more 


intense storms, precipitation changes (affecting agriculture), reduction in rain forest 
biodiversity, and rising sea levels. 


Note: 

You can explore a simulation of the greenhouse effect that takes the point of view that 
the atmosphere scatters (redirects) infrared radiation rather than absorbing it and 
reradiating it. You may want to run the simulation first with no greenhouse gases in the 
atmosphere and then look at how adding greenhouse gases affects the infrared radiation 
from the Earth and the Earth’s temperature. 


Note: 
Problem-Solving Strategy: Effects of Heat Transfer 


1. Examine the situation to determine what type of heat transfer is involved. 

2. Identify the type(s) of heat transfer—conduction, convection, or radiation. 

3. Identify exactly what needs to be determined in the problem (identify the 
unknowns). A written list is useful. 

4. Make a list of what is given or what can be inferred from the problem as stated 
(identify the knowns). 

5. Solve the appropriate equation for the quantity to be determined (the unknown). 

6. For conduction, use the equation P = —— [link] lists thermal conductivities. 
For convection, determine the amount of matter moved and the equation 
Q = mcAT, along with Q = mL; or Q = mLy if a substance changes phase. 
For radiation, the equation Pres = ce A (T>4 = Hi) gives the net heat transfer 
rate. 

7. Substitute the knowns along with their units into the appropriate equation and 
obtain numerical solutions complete with units. 

8. Check the answer to see if it is reasonable. Does it make sense? 


Note: 
Exercise: 


Problem: 


Check Your Understanding How much greater is the rate of heat radiation when 
a body is at the temperature 40 °C than when it is at the temperature 20 °C? 


Solution: 


The radiated heat is proportional to the fourth power of the absolute temperature. 
Because T; = 293 K and T2 = 313 K, the rate of heat transfer increases by about 
30% of the original rate. 


Summary 


e Heat is transferred by three different methods: conduction, convection, and 
radiation. 

e Heat conduction is the transfer of heat between two objects in direct contact with 
each other. 

e The rate of heat transfer P (energy per unit time) is proportional to the temperature 
difference 7}, — T, and the contact area A and inversely proportional to the 
distance d between the objects. 

e Convection is heat transfer by the macroscopic movement of mass. Convection can 
be natural or forced, and generally transfers thermal energy faster than conduction. 
Convection that occurs along with a phase change can transfer energy from cold 
regions to warm ones. 

e Radiation is heat transfer through the emission or absorption of electromagnetic 
waves. 

e The rate of radiative heat transfer is proportional to the emissivity e. For a perfect 
blackbody, e = 1, whereas a perfectly white, clear, or reflective body has e = 0, 
with real objects having values of e between 1 and 0. 

e The rate of heat transfer depends on the surface area and the fourth power of the 
absolute temperature: 

Equation: 


P =ceAT"*, 


where o = 5.67 x 107° J/s- m? - K* is the Stefan-Boltzmann constant and e is 
the emissivity of the body. The net rate of heat transfer from an object by radiation 
is 

Equation: 


os =oeA (T>* = ieee 


where 7) is the temperature of the object surrounded by an environment with 


uniform temperature 7» and e is the emissivity of the object. 


Key Equations 
Linear thermal expansion AL = aLAT 
Thermal expansion in two dimensions AA = 20a AAT 
Thermal expansion in three dimensions AV = BVAT 
Heat transfer Q = mcAT 
Transfer of heat in a calorimeter Qeold + Qhot = 0 
Heat due to phase change (melting and freezing) Q=mL; 
oe " i change (evaporation and (ager y 
Rate of conductive heat transfer P= bee 
Net rate of heat transfer by radiation Pret = 7eA (T2* as T;*) 


Conceptual Questions 


Exercise: 
Problem: 
What are the main methods of heat transfer from the hot core of Earth to its 
surface? From Earth’s surface to outer space? 


Exercise: 


Problem: 


When our bodies get too warm, they respond by sweating and increasing blood 
circulation to the surface to transfer thermal energy away from the core. What 
effect will those processes have on a person in a 40.0-°C hot tub? 


Solution: 


Increasing circulation to the surface will warm the person, as the temperature of the 
water is warmer than human body temperature. Sweating will cause no evaporative 
cooling under water or in the humid air immediately above the tub. 


Exercise: 


Problem: 


Shown below is a cut-away drawing of a thermos bottle (also known as a Dewar 
flask), which is a device designed specifically to slow down all forms of heat 
transfer. Explain the functions of the various parts, such as the vacuum, the 
silvering of the walls, the thin-walled long glass neck, the rubber support, the air 
layer, and the stopper. 


Glass walls 
with silvered 
surfaces 


Air layer Spring 


centering 
device 


Container 
Hot or cold 


liquid 


Vacuum 


Rubber support 


Exercise: 
Problem: 
Some electric stoves have a flat ceramic surface with heating elements hidden 
beneath. A pot placed over a heating element will be heated, while the surface only 
a few centimeters away is safe to touch. Why is ceramic, with a conductivity less 


than that of a metal but greater than that of a good insulator, an ideal choice for the 
stove top? 


Solution: 

It spread the heat over the area above the heating elements, evening the 

temperature there, but does not spread the heat much beyond the heating elements. 
Exercise: 

Problem: 

Loose-fitting white clothing covering most of the body, shown below, is ideal for 


desert dwellers, both in the hot Sun and during cold evenings. Explain how such 
clothing is advantageous during both day and night. 


Exercise: 
Problem: 
One way to make a fireplace more energy-efficient is to have room air circulate 


around the outside of the fire box and back into the room. Detail the methods of 
heat transfer involved. 


Solution: 


Heat is conducted from the fire through the fire box to the circulating air and then 
convected by the air into the room (forced convection). 

Exercise: 
Problem: 


On cold, clear nights horses will sleep under the cover of large trees. How does this 
help them keep warm? 


Exercise: 
Problem: 


When watching a circus during the day in a large, dark-colored tent, you sense 
significant heat transfer from the tent. Explain why this occurs. 


Solution: 
The tent is heated by the Sun and transfers heat to you by all three processes, 
especially radiation. 
Exercise: 
Problem: 
Satellites designed to observe the radiation from cold (3 K) dark space have sensors 


that are shaded from the Sun, Earth, and the Moon and are cooled to very low 
temperatures. Why must the sensors be at low temperature? 


Exercise: 
Problem: 
Why are thermometers that are used in weather stations shielded from the 


sunshine? What does a thermometer measure if it is shielded from the sunshine? 
What does it measure if it is not? 


Solution: 
If shielded, it measures the air temperature. If not, it measures the combined effect 
of air temperature and net radiative heat gain from the Sun. 

Exercise: 
Problem: 
Putting a lid on a boiling pot greatly reduces the heat transfer necessary to keep it 
boiling. Explain why. 


Exercise: 


Problem: 


Your house will be empty for a while in cold weather, and you want to save energy 
and money. Should you turn the thermostat down to the lowest level that will 
protect the house from damage such as freezing pipes, or leave it at the normal 
temperature? (If you don’t like coming back to a cold house, imagine that a timer 
controls the heating system so the house will be warm when you get back.) Explain 
your answer. 


Solution: 


Turn the thermostat down. To have the house at the normal temperature, the heating 
system must replace all the heat that was lost. For all three mechanisms of heat 
transfer, the greater the temperature difference between inside and outside, the 
more heat is lost and must be replaced. So the house should be at the lowest 
temperature that does not allow freezing damage. 


Exercise: 
Problem: 
You pour coffee into an unlidded cup, intending to drink it 5 minutes later. You can 
add cream when you pour the cup or right before you drink it. (The cream is at the 
same temperature either way. Assume that the cream and coffee come into thermal 


equilibrium with each other very quickly.) Which way will give you hotter coffee? 
What feature of this question is different from the previous one? 


Exercise: 
Problem: 
Broiling is a method of cooking by radiation, which produces somewhat different 
results from cooking by conduction or convection. A gas flame or electric heating 


element produces a very high temperature close to the food and above it. Why is 
radiation the dominant heat-transfer method in this situation? 


Solution: 
Air is a good insulator, so there is little conduction, and the heated air rises, so there 
is little convection downward. 
Exercise: 
Problem: 


On a cold winter morning, why does the metal of a bike feel colder than the wood 
of a porch? 


Problems 


Exercise: 
Problem: 
(a) Calculate the rate of heat conduction through house walls that are 13.0 cm thick 
and have an average thermal conductivity twice that of glass wool. Assume there 
are no windows or doors. The walls’ surface area is 120 m? and their inside surface 


is at 18.0 °C, while their outside surface is at 5.00 “C. (b) How many 1-kW room 
heaters would be needed to balance the heat transfer due to conduction? 


Solution: 


a. 1.01 x 10° W; b. One 1-kilowatt room heater is needed. 

Exercise: 
Problem: 
The rate of heat conduction out of a window on a winter day is rapid enough to 
chill the air next to it. To see just how rapidly the windows transfer heat by 
conduction, calculate the rate of conduction in watts through a 3.00-m? window 
that is 0.634 cm thick (1/4 in.) if the temperatures of the inner and outer surfaces 


are 5.00 °C and —10.0 °C, respectively. (This rapid rate will not be maintained— 
the inner surface will cool, even to the point of frost formation.) 


Exercise: 
Problem: 
Calculate the rate of heat conduction out of the human body, assuming that the core 
internal temperature is 37.0 °C, the skin temperature is 34.0 °C, the thickness of 


the fatty tissues between the core and the skin averages 1.00 cm, and the surface 
area is 1.40 m?. 


Solution: 


84.0 W 


Exercise: 


Problem: 


Suppose you stand with one foot on ceramic flooring and one foot on a wool 
carpet, making contact over an area of 80.0 cm? with each foot. Both the ceramic 
and the carpet are 2.00 cm thick and are 10.0 °C on their bottom sides. At what 
rate must heat transfer occur from each foot to keep the top of the ceramic and 
carpet at 33.0 °C? 


Exercise: 
Problem: 
A man consumes 3000 kcal of food in one day, converting most of it to thermal 


energy to maintain body temperature. If he loses half this energy by evaporating 
water (through breathing and sweating), how many kilograms of water evaporate? 


Solution: 


2.59 kg 
Exercise: 
Problem: 
A firewalker runs across a bed of hot coals without sustaining burns. Calculate the 


heat transferred by conduction into the sole of one foot of a firewalker given that 
the bottom of the foot is a 3.00-mm-thick callus with a conductivity at the low end 
of the range for wood and its density is 300 kg/ m°. The area of contact is 

25.0 cm?, the temperature of the coals is 700 °C, and the time in contact is 1.00 s. 
Ignore the evaporative cooling of sweat. 


Exercise: 
Problem: 
(a) What is the rate of heat conduction through the 3.00-cm-thick fur of a large 
animal having a 1.40-m? surface area? Assume that the animal’s skin temperature 
is 32.0 °C, that the air temperature is —5.00 °C, and that fur has the same thermal 


conductivity as air. (b) What food intake will the animal need in one day to replace 
this heat transfer? 


Solution: 


a. 39.7 W; b. 820 kcal 


Exercise: 


Problem: 


A walrus transfers energy by conduction through its blubber at the rate of 150 W 
when immersed in —1.00 °C water. The walrus’s internal core temperature is 
37.0 °C, and it has a surface area of 2.00 m2. What is the average thickness of its 
blubber, which has the conductivity of fatty tissues without blood? 


Exercise: 
Problem: 
Compare the rate of heat conduction through a 13.0-cm-thick wall that has an area 
of 10.0 m? and a thermal conductivity twice that of glass wool with the rate of heat 


conduction through a 0.750-cm-thick window that has an area of 2.00 m?, 
assuming the same temperature difference across each. 


Solution: 


kA(T>—T. 
_ ——, so that 


= 
(Q/t)war  _ KwattAwandwindow (2X 0.042 J/s-m-°C) (10.0 m?) (0.750 x 10-?m) 
(Q/t) window _ kwindow A window @wall _ (0.84 J/s-m- *C)(2.00 m?) (13.0 x 10-*m) 
This gives 0.0288 wall: window, or 35:1 window: wall 
Exercise: 
Problem: 


Suppose a person is covered head to foot by wool clothing with average thickness 
of 2.00 cm and is transferring energy by conduction through the clothing at the rate 
of 50.0 W. What is the temperature difference across the clothing, given the surface 
area is 1.40 m?? 


Exercise: 
Problem: 
Some stove tops are smooth ceramic for easy cleaning. If the ceramic is 0.600 cm 
thick and heat conduction occurs through the same area and at the same rate as 


computed in [link], what is the temperature difference across it? Ceramic has the 
same thermal conductivity as glass and brick. 


Solution: 

Q _ kA(T-Ti) _ RAAT 

@ = SATB) MAT 

AT = Hit) — __(6.00x 10 m)(2256W) yg °G = 105 x 103K 


kA (0.84 J/s-m-°C) (1.54 x 10°? m?) 


Exercise: 


Problem: 


One easy way to reduce heating (and cooling) costs is to add extra insulation in the 
attic of a house. Suppose a single-story cubical house already had 15 cm of 
fiberglass insulation in the attic and in all the exterior surfaces. If you added an 
extra 8.0 cm of fiberglass to the attic, by what percentage would the heating cost of 
the house drop? Take the house to have dimensions 10 m by 15 m by 3.0 m. Ignore 
air infiltration and heat loss through windows and doors, and assume that the 
interior is uniformly at one temperature and the exterior is uniformly at another. 


Exercise: 


Problem: 


Many decisions are made on the basis of the payback period: the time it will take 
through savings to equal the capital cost of an investment. Acceptable payback 
times depend upon the business or philosophy one has. (For some industries, a 
payback period is as small as 2 years.) Suppose you wish to install the extra 
insulation in the preceding problem. If energy cost $1.00 per million joules and the 
insulation was $4.00 per square meter, then calculate the simple payback time. 
Take the average AT for the 120-day heating season to be 15.0 °C. 


Solution: 


We found in the preceding problem that P = 126AT' W- °C as baseline energy 
use. So the total heat loss during this period is 

Q = (126 J/s- °C) (15.0 °C) (120 days) (86.4 x 10°s/day) = 1960 x 10°J 
. At the cost of $1/MJ, the cost is $1960. From an earlier problem, the savings is 
12% or $235/y. We need 150 m? of insulation in the attic. At $4/m/?, this is a $500 
cost. So the payback period is $600/ ($235/y) = 2.6 years (excluding labor 
costs). 


Additional Problems 


Exercise: 
Problem: 
In 1701, the Danish astronomer Ole Rgmer proposed a temperature scale with two 


fixed points, freezing water at 7.5 degrees, and boiling water at 60.0 degrees. What 
is the boiling point of oxygen, 90.2 K, on the Romer scale? 


Exercise: 


Problem: 


What is the percent error of thinking the melting point of tungsten is 3695 °C 
instead of the correct value of 3695 K? 


Solution: 


7.39% 
Exercise: 
Problem: 
An engineer wants to design a structure in which the difference in length between a 


steel beam and an aluminum beam remains at 0.500 m regardless of temperature, 
for ordinary temperatures. What must the lengths of the beams be? 


Exercise: 
Problem: 


How much stress is created in a steel beam if its temperature changes from —15 °C 
to 40 °C but it cannot expand? For steel, the Young’s modulus 


Y =210:* 10" N/ m’ from Stress, Strain, and Elastic Modulus. (Ignore the 
change in area resulting from the expansion.) 


Solution: 


4 = (210 x 10°Pa) (12 x 10°°/°C) (40°C — (—15 °C)) = 1.4 x 10° N/m’ 


Exercise: 


Problem: 


A brass rod (Y = 90 x 10°N/ m’), with a diameter of 0.800 cm and a length of 


1.20 m when the temperature is 25 °C, is fixed at both ends. At what temperature 
is the force in it at 36,000 N? 


Exercise: 


Problem: 


A mercury thermometer still in use for meteorology has a bulb with a volume of 
0.780 cm? and a tube for the mercury to expand into of inside diameter 0.130 mm. 
(a) Neglecting the thermal expansion of the glass, what is the spacing between 
marks 1 °C apart? (b) If the thermometer is made of ordinary glass (not a good 
idea), what is the spacing? 


Solution: 


a. 1.06 cm; b. 1.11 cm 
Exercise: 


Problem: 


Even when shut down after a period of normal use, a large commercial nuclear 
reactor transfers thermal energy at the rate of 150 MW by the radioactive decay of 
fission products. This heat transfer causes a rapid increase in temperature if the 
cooling system fails (1 watt = 1 joule/second or 1 W = 1 J/s and 

1 MW = 1 megawatt). (a) Calculate the rate of temperature increase in degrees 
Celsius per second (°C/s) if the mass of the reactor core is 1.60 x 10° kg and it 
has an average specific heat of 0.3349 kJ/kg - °C. (b) How long would it take to 
obtain a temperature increase of 2000 °C, which could cause some metals holding 
the radioactive materials to melt? (The initial rate of temperature increase would be 
greater than that calculated here because the heat transfer is concentrated in a 
smaller mass. Later, however, the temperature increase would slow down because 
the 500,000-kg steel containment vessel would also begin to heat up.) 


Exercise: 


Problem: 


You leave a pastry in the refrigerator on a plate and ask your roommate to take it 
out before you get home so you can eat it at room temperature, the way you like it. 
Instead, your roommate plays video games for hours. When you return, you notice 
that the pastry is still cold, but the game console has become hot. Annoyed, and 
knowing that the pastry will not be good if it is microwaved, you warm up the 
pastry by unplugging the console and putting it in a clean trash bag (which acts as a 
perfect calorimeter) with the pastry on the plate. After a while, you find that the 
equilibrium temperature is a nice, warm 38.3 “C. You know that the game console 
has a mass of 2.1 kg. Approximate it as having a uniform initial temperature of 

45 °C. The pastry has a mass of 0.16 kg and a specific heat of 3.0k J/(kg - °C), 
and is at a uniform initial temperature of 4.0 °C. The plate is at the same 
temperature and has a mass of 0.24 kg and a specific heat of 0.90 J/(kg - °C). 
What is the specific heat of the console? 


Solution: 


1.7 kJ/(kg - °C) 
Exercise: 


Problem: 


Two solid spheres, A and B, made of the same material, are at temperatures of 0 °C 
and 100 °C, respectively. The spheres are placed in thermal contact in an ideal 
calorimeter, and they reach an equilibrium temperature of 20 °C. Which is the 
bigger sphere? What is the ratio of their diameters? 


Exercise: 


Problem: 


In some countries, liquid nitrogen is used on dairy trucks instead of mechanical 
refrigerators. A 3.00-hour delivery trip requires 200 L of liquid nitrogen, which has 
a density of 808 kg/ m°. (a) Calculate the heat transfer necessary to evaporate this 
amount of liquid nitrogen and raise its temperature to 3.00 °C. (Use cp and assume 
it is constant over the temperature range.) This value is the amount of cooling the 
liquid nitrogen supplies. (b) What is this heat transfer rate in kilowatt-hours? (c) 
Compare the amount of cooling obtained from melting an identical mass of 0-° C 
ice with that from evaporating the liquid nitrogen. 


Solution: 


a. 1.57 x 10*kcal; b. 18.3kW -h;c. 1.29 x 104kcal 


Exercise: 
Problem: 
Some gun fanciers make their own bullets, which involves melting lead and casting 


it into lead slugs. How much heat transfer is needed to raise the temperature and 
melt 0.500 kg of lead, starting from 25.0 °C? 


Exercise: 
Problem: 
A 0.800-kg iron cylinder at a temperature of 1.00 x 10° °C is dropped into an 


insulated chest of 1.00 kg of ice at its melting point. What is the final temperature, 
and how much ice has melted? 


Solution: 


6.3 °C. All of the ice melted. 


Exercise: 


Problem: Repeat the preceding problem with 2.00 kg of ice instead of 1.00 kg. 
Exercise: 
Problem: 


Repeat the preceding problem with 0.500 kg of ice, assuming that the ice is initially 
in a copper container of mass 1.50 kg in equilibrium with the ice. 


Solution: 


63.9 °C, all the ice melted 
Exercise: 
Problem: 
A 30.0-g ice cube at its melting point is dropped into an aluminum calorimeter of 


mass 100.0 g in equilibrium at 24.0 °C with 300.0 g of an unknown liquid. The 
final temperature is 4.0 “C. What is the heat capacity of the liquid? 


Exercise: 


Problem: 


(a) Calculate the rate of heat conduction through a double-paned window that has a 
1.50-m? area and is made of two panes of 0.800-cm-thick glass separated by a 
1.00-cm air gap. The inside surface temperature is 15.0 °C, while that on the 
outside is —10.0 °C. (Hint: There are identical temperature drops across the two 
glass panes. First find these and then the temperature drop across the air gap. This 
problem ignores the increased heat transfer in the air gap due to convection.) (b) 
Calculate the rate of heat conduction through a 1.60-cm-thick window of the same 
area and with the same temperatures. Compare your answer with that for part (a). 


Solution: 


a. 83 W; b. 1.97 x 10° W; The single-pane window has a rate of heat conduction 
equal to 1969/83, or 24 times that of a double-pane window. 


Exercise: 


Problem: 


(a) An exterior wall of a house is 3 m tall and 10 m wide. It consists of a layer of 
drywall with an R factor of 0.56, a layer 3.5 inches thick filled with fiberglass batts, 
and a layer of insulated siding with an R factor of 2.6. The wall is built so well that 
there are no leaks of air through it. When the inside of the wall is at 22 °C and the 
outside is at —2 °C, what is the rate of heat flow through the wall? (b) More 
realistically, the 3.5-inch space also contains 2-by-4 studs—wooden boards 1.5 
inches by 3.5 inches oriented so that 3.5-inch dimension extends from the drywall 
to the siding. They are “on 16-inch centers,” that is, the centers of the studs are 16 
inches apart. What is the heat current in this situation? Don’t worry about one stud 
more or less. 


Exercise: 
Problem: 
For the human body, what is the rate of heat transfer by conduction through the 
body’s tissue with the following conditions: the tissue thickness is 3.00 cm, the 
difference in temperature is 2.00 °C, and the skin area is 1.50 m7”. How does this 


compare with the average heat transfer rate to the body resulting from an energy 
intake of about 2400 kcal per day? (No exercise is included.) 


Solution: 


The rate of heat transfer by conduction is 20.0 W. On a daily basis, this is 1,728 
kJ/day. Daily food intake is 2400 kcal/d x 4186 J/kcal = 10,050kJ/day. So 


only 17.2% of energy intake goes as heat transfer by conduction to the environment 
at this AT’. 


Exercise: 


Problem: 


You have a Dewar flask (a laboratory vacuum flask) that has an open top and 
straight sides, as shown below. You fill it with water and put it into the freezer. It is 
effectively a perfect insulator, blocking all heat transfer, except on the top. After a 
time, ice forms on the surface of the water. The liquid water and the bottom surface 
of the ice, in contact with the liquid water, are at 0 “C. The top surface of the ice is 
at the same temperature as the air in the freezer, —18 °C. Set the rate of heat flow 
through the ice equal to the rate of loss of heat of fusion as the water freezes. When 
the ice layer is 0.700 cm thick, find the rate in m/s at which the ice is thickening. 


-18°C 


0°C 


Exercise: 
Problem: 
An infrared heater for a sauna has a surface area of 0.050 m? and an emissivity of 
0.84. What temperature must it run at if the required power is 360 W? Neglect the 
temperature of the environment. 


Solution: 


620 K 


Exercise: 


Problem: 


(a) Determine the power of radiation from the Sun by noting that the intensity of 
the radiation at the distance of Earth is 1370 W/ m’”. Hint: That intensity will be 
found everywhere on a spherical surface with radius equal to that of Earth’s orbit. 
(b) Assuming that the Sun’s temperature is 5780 K and that its emissivity is 1, find 
its radius. 


Challenge Problems 


Exercise: 


Problem: 


A pendulum is made of a rod of length L and negligible mass, but capable of 
thermal expansion, and a weight of negligible size. (a) Show that when the 
temperature increases by dT, the period of the pendulum increases by a fraction 
aLdT /2. (b) A clock controlled by a brass pendulum keeps time correctly at 

10 °C. If the room temperature is 30 °C, does the clock run faster or slower? What 
is its error in seconds per day? 


Solution: 


Denoting the period by P, we know P = 27/ L/g. When the temperature 
increases by dT, the length increases by aLdT’. Then the new length is a. 


P = 2ny/ Heald — Ony/L (1 + ad) = 24/4 (1+ badT) = P(1+ daar) 


by the binomial expansion. b. The clock runs slower, as its new period is 1.00019 s. 
It loses 16.4 s per day. 


Exercise: 
Problem: 
At temperatures of a few hundred kelvins the specific heat capacity of copper 
approximately follows the empirical formula c = a + BT + 6T ~, where 
a = 349 J/kg -K, 6 = 0.107 J/kg - K?, andd = 4.58 x 10°J-kge- K. How 


much heat is needed to raise the temperature of a 2.00-kg piece of copper from 
20 °C to 250 °C? 


Exercise: 


Problem: 


In a calorimeter of negligible heat capacity, 200 g of steam at 150 °C and 100 g of 
ice at —40 °C are mixed. The pressure is maintained at 1 atm. What is the final 
temperature, and how much steam, ice, and water are present? 


Solution: 


The amount of heat to melt the ice and raise it to 100 °C is not enough to condense 
the steam, but it is more than enough to lower the steam’s temperature by 50 °C, so 
the final state will consist of steam and liquid water in equilibrium, and the final 
temperature is 100 °C; 9.5 g of steam condenses, so the final state contains 49.5 g 
of steam and 40.5 g of liquid water. 


Exercise: 


Problem: 


An astronaut performing an extra-vehicular activity (space walk) shaded from the 
Sun is wearing a spacesuit that can be approximated as perfectly white (e = 0) 
except fora5cm x 8cm patch in the form of the astronaut’s national flag. The 
patch has emissivity 0.300. The spacesuit under the patch is 0.500 cm thick, with a 
thermal conductivity k = 0.0600 W/m °C, and its inner surface is at a 
temperature of 20.0 °C. What is the temperature of the patch, and what is the rate 
of heat loss through it? Assume the patch is so thin that its outer surface is at the 
same temperature as the outer surface of the spacesuit under it. Also assume the 
temperature of outer space is 0 K. You will get an equation that is very hard to 
solve in closed form, so you can solve it numerically with a graphing calculator, 
with software, or even by trial and error with a calculator. 


Exercise: 


Problem: 


Find the growth of an ice layer as a function of time in a Dewar flask as seen in 
[link]. Call the thickness of the ice layer L. (a) Derive an equation for dL/dt in 
terms of L , the temperature T above the ice, and the properties of ice (which you 
can leave in symbolic form instead of substituting the numbers). (b) Solve this 
differential equation assuming that at ¢ = 0, you have L = 0. If you have studied 
differential equations, you will know a technique for solving equations of this type: 
manipulate the equation to get dL/dt multiplied by a (very simple) function of L on 
one side, and integrate both sides with respect to time. Alternatively, you may be 
able to use your knowledge of the derivatives of various functions to guess the 
solution, which has a simple dependence on t. (c) Will the water eventually freeze 
to the bottom of the flask? 


Solution: 


a. dL /dT = kT /pL;b. L = \/2kTt/pLs; c. yes 
Exercise: 


Problem: 


As the very first rudiment of climatology, estimate the temperature of Earth. 
Assume it is a perfect sphere and its temperature is uniform. Ignore the greenhouse 
effect. Thermal radiation from the Sun has an intensity (the “solar constant” S) of 
about 1370 W/ m’ at the radius of Earth’s orbit. (a) Assuming the Sun’s rays are 
parallel, what area must S be multiplied by to get the total radiation intercepted by 
Earth? It will be easiest to answer in terms of Earth’s radius, R. (b) Assume that 
Earth reflects about 30% of the solar energy it intercepts. In other words, Earth has 
an albedo with a value of A = 0.3. In terms of S, A, and R, what is the rate at 
which Earth absorbs energy from the Sun? (c) Find the temperature at which Earth 
radiates energy at the same rate. Assume that at the infrared wavelengths where it 
radiates, the emissivity e is 1. Does your result show that the greenhouse effect is 
important? (d) How does your answer depend on the the area of Earth? 


Exercise: 


Problem: 


Let’s stop ignoring the greenhouse effect and incorporate it into the previous 
problem in a very rough way. Assume the atmosphere is a single layer, a spherical 
shell around Earth, with an emissivity e = 0.77 (chosen simply to give the right 
answer) at infrared wavelengths emitted by Earth and by the atmosphere. However, 
the atmosphere is transparent to the Sun’s radiation (that is, assume the radiation is 
at visible wavelengths with no infrared), so the Sun’s radiation reaches the surface. 
The greenhouse effect comes from the difference between the atmosphere’s 
transmission of visible light and its rather strong absorption of infrared. Note that 
the atmosphere’s radius is not significantly different from Earth’s, but since the 
atmosphere is a layer above Earth, it emits radiation both upward and downward, 
so it has twice Earth’s area. There are three radiative energy transfers in this 
problem: solar radiation absorbed by Earth’s surface; infrared radiation from the 
surface, which is absorbed by the atmosphere according to its emissivity; and 
infrared radiation from the atmosphere, half of which is absorbed by Earth and half 
of which goes out into space. Apply the method of the previous problem to get an 
equation for Earth’s surface and one for the atmosphere, and solve them for the two 
unknown temperatures, surface and atmosphere. 


a. In terms of Earth’s radius, the constant o, and the unknown temperature T’, of 
the surface, what is the power of the infrared radiation from the surface? 


b. What is the power of Earth’s radiation absorbed by the atmosphere? 

c. In terms of the unknown temperature 7 of the atmosphere, what is the power 
radiated from the atmosphere? 

d. Write an equation that says the power of the radiation the atmosphere absorbs 
from Earth equals the power of the radiation it emits. 

e. Half of the power radiated by the atmosphere hits Earth. Write an equation 
that says that the power Earth absorbs from the atmosphere and the Sun equals 
the power that it emits. 

f. Solve your two equations for the unknown temperature of Earth. 

For steps that make this model less crude, see for example the lectures by Paul 
O’Gorman. 


Solution: 


a. 4 (nR?)T;*; b. deonR*T,*; c. 8eomR?T,*; dT} = 2T?; e. 
eoT! + $(1— A)S = oT; f. 288K 


Glossary 


conduction 
heat transfer through stationary matter by physical contact 


convection 
heat transfer by the macroscopic movement of fluid 


emissivity 
measure of how well an object radiates 


greenhouse effect 
warming of the earth that is due to gases such as carbon dioxide and methane that 
absorb infrared radiation from Earth’s surface and reradiate it in all directions, thus 
sending some of it back toward Earth 


net rate of heat transfer by radiation 
Pig = oeA (T>4 — T;*) 


radiation 
energy transferred by electromagnetic waves directly as a result of a temperature 
difference 


rate of conductive heat transfer 
rate of heat transfer from one material to another 


Stefan-Boltzmann law of radiation 
P = oc AeT*, where o = 5.67 x 10-8 J/s- m? - K‘ is the Stefan-Boltzmann 
constant, A is the surface area of the object, T is the absolute temperature, and e is 
the emissivity 


thermal conductivity 
property of a material describing its ability to conduct heat 


Introduction 
class="introduction" 


A volcanic eruption 
releases tons of gas and 
dust into the 
atmosphere. Most of 
the gas is water vapor, 
but several other gases 
are common, including 
greenhouse gases such 
as carbon dioxide and 
acidic pollutants such 
as sulfur dioxide. 
However, the emission 
of volcanic gas is not 
all bad: Many 
geologists believe that 
in the earliest stages of 
Earth’s formation, 
volcanic emissions 
formed the early 
atmosphere. (credit: 
modification of work 
by 
“Boaworm”/Wikimedi 
a Commons) 


Gases are literally all around us—the air that we breathe is a mixture of 
gases. Other gases include those that make breads and cakes soft, those that 
make drinks fizzy, and those that burn to heat many homes. Engines and 
refrigerators depend on the behaviors of gases, as we will see in later 
chapters. 


As we discussed in the preceding chapter, the study of heat and temperature 
is part of an area of physics known as thermodynamics, in which we require 
a system to be macroscopic, that is, to consist of a huge number (such as 
107%) of molecules. We begin by considering some macroscopic properties 
of gases: volume, pressure, and temperature. The simple model of a 
hypothetical “ideal gas” describes these properties of a gas very accurately 
under many conditions. We move from the ideal gas model to a more 
widely applicable approximation, called the Van der Waals model. 


To understand gases even better, we must also look at them on the 
microscopic scale of molecules. In gases, the molecules interact weakly, so 
the microscopic behavior of gases is relatively simple, and they serve as a 
good introduction to systems of many molecules. The molecular model of 
gases is called the kinetic theory of gases and is one of the classic examples 
of a molecular model that explains everyday behavior. 


Molecular Model of an Ideal Gas 
By the end of this section, you will be able to: 


e Apply the ideal gas law to situations involving the pressure, volume, temperature, and the number of 
molecules of a gas 

e Use the unit of moles in relation to numbers of molecules, and molecular and macroscopic masses 

e Explain the ideal gas law in terms of moles rather than numbers of molecules 

e Apply the van der Waals gas law to situations where the ideal gas law is inadequate 


In this section, we explore the thermal behavior of gases. Our word “gas” comes from the Flemish word 
meaning “chaos,” first used for vapors by the seventeenth-century chemist J. B. van Helmont. The term was 
more appropriate than he knew, because gases consist of molecules moving and colliding with each other at 
random. This randomness makes the connection between the microscopic and macroscopic domains 
simpler for gases than for liquids or solids. 


How do gases differ from solids and liquids? Under ordinary conditions, such as those of the air around us, 
the difference is that the molecules of gases are much farther apart than those of solids and liquids. Because 
the typical distances between molecules are large compared to the size of a molecule, as illustrated in [link], 
the forces between them are considered negligible, except when they come into contact with each other 
during collisions. Also, at temperatures well above the boiling temperature, the motion of molecules is fast, 
and the gases expand rapidly to occupy all of the accessible volume. In contrast, in liquids and solids, 
molecules are closer together, and the behavior of molecules in liquids and solids is highly constrained by 
the molecules’ interactions with one another. The macroscopic properties of such substances depend 
strongly on the forces between the molecules, and since many molecules are interacting, the resulting 
“many-body problems” can be extremely complicated (see Condensed Matter Physics). 


@ -—e 
= @ 
~) \ _— 
a oe. 
J » 
\ \ 
, @ 
. @ @ . 
—— « — «x 
\ 
\ 
e > . \ 


Atoms and molecules in a gas are typically widely 
separated. Because the forces between them are quite 
weak at these distances, the properties of a gas depend 
more on the number of atoms per unit volume and on 
temperature than on the type of atom. 


The Gas Laws 


In the previous chapter, we saw one consequence of the large intermolecular spacing in gases: Gases are 
easily compressed. [link] shows that gases have larger coefficients of volume expansion than either solids 
or liquids. These large coefficients mean that gases expand and contract very rapidly with temperature 
changes. We also saw (in the section on thermal expansion) that most gases expand at the same rate or have 
the same coefficient of volume expansion, {. This raises a question: Why do all gases act in nearly the same 
way, when all the various liquids and solids have widely varying expansion rates? 


To study how the pressure, temperature, and volume of a gas relate to one another, consider what happens 
when you pump air into a deflated car tire. The tire’s volume first increases in direct proportion to the 
amount of air injected, without much increase in the tire pressure. Once the tire has expanded to nearly its 
full size, the tire’s walls limit its volume expansion. If we continue to pump air into the tire, the pressure 
increases. When the car is driven and the tires flex, their temperature increases, and therefore the pressure 
increases even further ((link]). 


Increase 
temperature 


(b) (c) 


(a) When air is pumped into a deflated tire, its volume first increases without much increase in 
pressure. (b) When the tire is filled to a certain point, the tire walls resist further expansion, and the 
pressure increases with more air. (c) Once the tire is inflated, its pressure increases with temperature. 


[link] shows data from the experiments of Robert Boyle (1627-1691), illustrating what is now called 
Boyle’s law: At constant temperature and number of molecules, the absolute pressure of a gas and its 
volume are inversely proportional. (Recall from Fluid Mechanics that the absolute pressure is the true 
pressure and the gauge pressure is the absolute pressure minus the ambient pressure, typically atmospheric 
pressure.) The graph in [link] displays this relationship as an inverse proportionality of volume to pressure. 


Volume (arbitrary units) 


0 0.005 0.01 0.015 0.02 0.025 0.03 0.035 0.04 
1/pressure (inverse inches of mercury) 


Robert Boyle and his assistant found that volume and pressure are 
inversely proportional. Here their data are plotted as V versus 1/p; the 
linearity of the graph shows the inverse proportionality. The number shown 
as the volume is actually the height in inches of air in a cylindrical glass 
tube. The actual volume was that height multiplied by the cross-sectional 
area of the tube, which Boyle did not publish. The data are from Boyle’s 
book A Defence of the Doctrine Touching the Spring and Weight of the 
Air..., p. 60.[footnote] 
http://bvpb.mcu.es/en/consulta/registro.cmd?id=406806 


[link] shows experimental data illustrating what is called Charles’s law, after Jacques Charles (1746-1823). 
Charles’s law states that at constant pressure and number of molecules, the volume of a gas is proportional 
to its absolute temperature. 
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Experimental data showing that at constant pressure, volume is 
approximately proportional to temperature. The best-fit line passes 
approximately through the origin.[ footnote] 
http://chemed.chem.purdue.edu/genchem/history/charles.html 


Similar is Amonton’s or Gay-Lussac’s law, which states that at constant volume and number of molecules, 
the pressure is proportional to the temperature. That law is the basis of the constant-volume gas 
thermometer, discussed in the previous chapter. (The histories of these laws and the appropriate credit for 
them are more complicated than can be discussed here.) 


It is known experimentally that for gases at low density (such that their molecules occupy a negligible 
fraction of the total volume) and at temperatures well above the boiling point, these proportionalities hold 
to a good approximation. Not surprisingly, with the other quantities held constant, either pressure or volume 
is proportional to the number of molecules. More surprisingly, when the proportionalities are combined into 
a single equation, the constant of proportionality is independent of the composition of the gas. The resulting 
equation for all gases applies in the limit of low density and high temperature; it’s the same for oxygen as 
for helium or uranium hexafluoride. A gas at that limit is called an ideal gas; it obeys the ideal gas law, 
which is also called the equation of state of an ideal gas. 


Note: 

Ideal Gas Law 

The ideal gas law states that 
Equation: 


pV = NkpT, 


where p is the absolute pressure of a gas, V is the volume it occupies, N is the number of molecules in the 
gas, and T is its absolute temperature. 


The constant kg is called the Boltzmann constant in honor of the Austrian physicist Ludwig Boltzmann 
(1844-1906) and has the value 
Equation: 


kp = 1.38 x 107-33 /K. 


The ideal gas law describes the behavior of any real gas when its density is low enough or its temperature 
high enough that it is far from liquefaction. This encompasses many practical situations. In the next section, 
we’ ll see why it’s independent of the type of gas. 


In many situations, the ideal gas law is applied to a sample of gas with a constant number of molecules; for 
instance, the gas may be in a sealed container. If N is constant, then solving for N shows that pV/T is 
constant. We can write that fact in a convenient form: 


Note: 
Equation: 


piVi ie P2V2 
Ti Tr’ 


where the subscripts 1 and 2 refer to any two states of the gas at different times. Again, the temperature 
must be expressed in kelvin and the pressure must be absolute pressure, which is the sum of gauge pressure 
and atmospheric pressure. 


Example: 

Calculating Pressure Changes Due to Temperature Changes 

Suppose your bicycle tire is fully inflated, with an absolute pressure of 7.00 x 10° Pa (a gauge pressure 
of just under 90.0 lb /in.”) at a temperature of 18.0 “°C. What is the pressure after its temperature has risen 
to 35.0 °C ona hot day? Assume there are no appreciable leaks or changes in volume. 

Strategy 

The pressure in the tire is changing only because of changes in temperature. We know the initial pressure 
po = 7.00 x 10° Pa, the initial temperature Ty = 18.0 °C, and the final temperature T; = 35.0 °C. We 
must find the final pressure ps. Since the number of molecules is constant, we can use the equation 
Equation: 


PrV_ _ PoVo 
Tt Tan 


Since the volume is constant, V; and Vo are the same and they divide out. Therefore, 
Equation: 


We can then rearrange this to solve for pr : 
Equation: 


Pf = Po 


where the temperature must be in kelvin. 
Solution 


1. Convert temperatures from degrees Celsius to kelvin 
Equation: 


Ty = (18.0 + 273)K = 291K, 


Equation: 
T; = (35.0 + 273)K = 308 K. 


2. Substitute the known values into the equation, 
Equation: 


ip 308 K 
pe =n 700) 10> Pa (ae 
0 


= 7.41 x 10°Pa. 
ip 291K 


Significance 

The final temperature is about 6 % greater than the original temperature, so the final pressure is about 6 % 
greater as well. Note that absolute pressure (see Fluid Mechanics) and absolute temperature (see 
Temperature and Heat) must be used in the ideal gas law. 


Example: 

Calculating the Number of Molecules in a Cubic Meter of Gas 

How many molecules are in a typical object, such as gas in a tire or water in a glass? This calculation can 
give us an idea of how large N typically is. Let’s calculate the number of molecules in the air that a typical 
healthy young adult inhales in one breath, with a volume of 500 mL, at standard temperature and pressure 
(STP), which is defined as 0 °C and atmospheric pressure. (Our young adult is apparently outside in 
winter.) 

Strategy 

Because pressure, volume, and temperature are all specified, we can use the ideal gas law, pV = NkpT, to 
find N. 

Solution 


1. Identify the knowns. 
Equation: 


T =0°C = 273K, p=1.01 x 10°Pa, V = 500mL =5 x 10°* m’, kg = 1.38 x 10% IJ/K 


2. Substitute the known values into the equation and solve for N. 


Equation: 


1.01 x 10°Pa) (5 x 10-*m? 
IN = De = ( s a a a — 1.34 x 107? molecules 
kgT (1.38 x 10-*3 J/K) (273 K) 


Significance 

N is huge, even in small volumes. For example, 1 cm? of a gas at STP contains 2.68 x 10'° molecules. 
Once again, note that our result for N is the same for all types of gases, including mixtures. 

As we observed in the chapter on fluid mechanics, pascals are N/ m”, so Pa- m3 = N-m = J. Thus, our 
result for N is dimensionless, a pure number that could be obtained by counting (in principle) rather than 
measuring. As it is the number of molecules, we put “molecules” after the number, keeping in mind that it 
is an aid to communication rather than a unit. 


Moles and Avogadro’s Number 


It is often convenient to measure the amount of substance with a unit on a more human scale than 
molecules. The SI unit for this purpose was developed by the Italian scientist Amedeo Avogadro (1776— 
1856). (He worked from the hypothesis that equal volumes of gas at equal pressure and temperature contain 
equal numbers of molecules, independent of the type of gas. As mentioned above, this hypothesis has been 
confirmed when the ideal gas approximation applies.) A mole (abbreviated mol) is defined as the amount of 
any substance that contains as many molecules as there are atoms in exactly 12 grams (0.012 kg) of carbon- 
12. (Technically, we should say “formula units,” not “molecules,” but this distinction is irrelevant for our 
purposes.) The number of molecules in one mole is called Avogadro’s number (JV), and the value of 
Avogadro’s number is now known to be 

Equation: 


Na = 6.02 x 107? mol™?. 


We can now write V = Nan, where n represents the number of moles of a substance. 


Avogadro’s number relates the mass of an amount of substance in grams to the number of protons and 
neutrons in an atom or molecule (12 for a carbon-12 atom), which roughly determine its mass. It’s natural 
to define a unit of mass such that the mass of an atom is approximately equal to its number of neutrons and 
protons. The unit of that kind accepted for use with the SI is the unified atomic mass unit (u), also called the 
dalton. Specifically, a carbon-12 atom has a mass of exactly 12 u, so that its molar mass M in grams per 
mole is numerically equal to the mass of one carbon-12 atom in u. That equality holds for any substance. In 
other words, V4 is not only the conversion from numbers of molecules to moles, but it is also the 
conversion from u to grams: 6.02 x 107 u=1 g. See [link]. 


Mt. Everest 
(for scale) 


Table tennis balls 
yA 


How big is a mole? On a macroscopic level, Avogadro’s number of table tennis balls would 
cover Earth to a depth of about 40 km. 


Now letting m, stand for the mass of a sample of a substance, we have m, = nM. Letting m stand for the 
mass of a molecule, we have M = Nam. 


Note: 
Exercise: 


Problem: 


Check Your Understanding The recommended daily amount of vitamin Bs or niacin, CsNH;Oz, for 
women who are not pregnant or nursing, is 14 mg. Find the number of molecules of niacin in that 
amount. 


Solution: 


We first need to calculate the molar mass (the mass of one mole) of niacin. To do this, we must 
multiply the number of atoms of each element in the molecule by the element’s molar mass. 
(6 mol of carbon) (12.0 g/mol) + (5 mol hydrogen) (1.0 g/mol) 
+ (1 mol nitrogen) (14 g/mol) + (2 mol oxygen) (16.0 g/mol) = 123 g/mol 
Then we need to calculate the number of moles in 14 mg. 


14mg lg _ —4 
(5) ( soating ) = 1.14 x 1074 mol. 
Then, we use Avogadro’s number to calculate the number of molecules: 
N =nN, = (1.14 x 10~* mol) (6.02 x 107’ molecules/mol) = 6.85 x 10'% molecules. 


Note: 
Exercise: 


Problem: 


Check Your Understanding The density of air in a classroom (p = 1.00 atm and T = 20 °C) is 
1.28 kg/ m*, At what pressure is the density 0.600 kg/ m’ if the temperature is kept constant? 


Solution: 


The density of a gas is equal to a constant, the average molecular mass, times the number density N/V. 
From the ideal gas law, pV = NkgT, we see that N/V = p/kpT’. Therefore, at constant 
temperature, if the density and, consequently, the number density are reduced by half, the pressure 
must also be reduced by half, and pp = 0.500 atm. 


The Ideal Gas Law Restated using Moles 


A very common expression of the ideal gas law uses the number of moles in a sample, n, rather than the 
number of molecules, N. We start from the ideal gas law, 
Equation: 


pV = NkpT, 


and multiply and divide the right-hand side of the equation by Avogadro’s number Na. This gives us 
Equation: 


N 
V = —WNa kpT. 
Pp Ma A KB 


Note that n = N/N,4q is the number of moles. We define the universal gas constant as R = Nakp, and 
obtain the ideal gas law in terms of moles. 


Note: 
Ideal Gas Law (in terms of moles) 
In terms of number of moles n, the ideal gas law is written as 


Equation: 
pV =nkRT. 
In SI units, 
Equation: 
23 -1 —23 J J 
R = Nakg = (6.02 x 107 mol’) { 1.38 x 10 21:5)! 
K mol -K 


In other units, 
Equation: 


cal = te es 


De Lod ae mol- K" 


You can use whichever value of R is most convenient for a particular problem. 


Example: 
Density of Air at STP and in a Hot Air Balloon 


Calculate the density of dry air (a) under standard conditions and (b) in a hot air balloon at a temperature of 
120°C. Dry air is approximately 78 % No, 21% Oo, and 1% Ar. 
Strategy and Solution 


a. We are asked to find the density, or mass per cubic meter. We can begin by finding the molar mass. If 
we have a hundred molecules, of which 78 are nitrogen, 21 are oxygen, and 1 is argon, the average 

_ 78my,+21mo,+ . a ; 

molecular mass is a. or the mass of each constituent multiplied by its percentage. The 


same applies to the molar mass, which therefore is 
Equation: 


M = 0.78 My, + 0.21 Mo, + 0.01 Ma, = 29.0 g/mol. 


Now we can find the number of moles per cubic meter. We use the ideal gas law in terms of moles, 
pV —Thl, wih p — 00atm. . — 274k, V — 1m and — 8.3L J/mol - K. The most 
convenient choice for R in this case is R = 8.31 J/mol - K because the known quantities are in SI 
units: 

Equation: 


pV (1.01 x 10°Pa) (1m*) 
a a ee ee AA nol: 
RT (8.31 J/mol - K) (273K) 


Then, the mass m, of that air is 
Equation: 


m; = nM = (44.5 mol)(29.0 g/mol) = 1290 g = 1.29kg. 


Finally the density of air at STP is 
Equation: 
ms, _ 1.29kg 


77 ae = 1.29kg/m’. 
m 


p= 


b. The air pressure inside the balloon is still 1 atm because the bottom of the balloon is open to the 
atmosphere. The calculation is the same except that we use a temperature of 120 °C, which is 393 K. 
We can repeat the calculation in (a), or simply observe that the density is proportional to the number 
of moles, which is inversely proportional to the temperature. Then using the subscripts 1 for air at 
STP and 2 for the hot air, we have 
Equation: 


Hi 273K 


1.29 kg/m?) — 0.896 kg/m’. 
303K | g/m”) g/m 


Significance 

Using the methods of Archimedes’ Principle and Buoyancy, we can find that the net force on 2200 m? of 
air at 120°C is Fy — Fy = PatmosphereV 9 — Phot airVg = 8.49 x 10° N, or enough to lift about 867 kg. 
The mass density and molar density of air at STP, found above, are often useful numbers. From the molar 
density, we can easily determine another useful number, the volume of a mole of any ideal gas at STP, 
which is 22.4 L. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Liquids and solids have densities on the order of 1000 times greater 
than gases. Explain how this implies that the distances between molecules in gases are on the order of 
10 times greater than the size of their molecules. 


Solution: 


Density is mass per unit volume, and volume is proportional to the size of a body (such as the radius 
of a sphere) cubed. So if the distance between molecules increases by a factor of 10, then the volume 
occupied increases by a factor of 1000, and the density decreases by a factor of 1000. Since we 
assume molecules are in contact in liquids and solids, the distance between their centers is on the 
order of their typical size, so the distance in gases is on the order of 10 times as great. 


The ideal gas law is closely related to energy: The units on both sides of the equation are joules. The right- 
hand side of the ideal gas law equation is NkpT’. This term is roughly the total translational kinetic energy 
(which, when discussing gases, refers to the energy of translation of a molecule, not that of vibration of its 
atoms or rotation) of N molecules at an absolute temperature T, as we will see formally in the next section. 
The left-hand side of the ideal gas law equation is pV. As mentioned in the example on the number of 
molecules in an ideal gas, pressure multiplied by volume has units of energy. The energy of a gas can be 
changed when the gas does work as it increases in volume, something we explored in the preceding chapter, 
and the amount of work is related to the pressure. This is the process that occurs in gasoline or steam 
engines and turbines, as we’l see in the next chapter. 


Note: 

Problem-Solving Strategy: The Ideal Gas Law 

Step 1. Examine the situation to determine that an ideal gas is involved. Most gases are nearly ideal unless 
they are close to the boiling point or at pressures far above atmospheric pressure. 

Step 2. Make a list of what quantities are given or can be inferred from the problem as stated (identify the 
known quantities). 

Step 3. Identify exactly what needs to be determined in the problem (identify the unknown quantities). A 
written list is useful. 

Step 4. Determine whether the number of molecules or the number of moles is known or asked for to 
decide whether to use the ideal gas law as pV = NkgT, where N is the number of molecules, or 

pV = nRT, where n is the number of moles. 

Step 5. Convert known values into proper SI units (K for temperature, Pa for pressure, m® for volume, 
molecules for N, and moles for n). If the units of the knowns are consistent with one of the non-SI values 
of R, you can leave them in those units. Be sure to use absolute temperature and absolute pressure. 

Step 6. Solve the ideal gas law for the quantity to be determined (the unknown quantity). You may need to 
take a ratio of final states to initial states to eliminate the unknown quantities that are kept fixed. 

Step 7. Substitute the known quantities, along with their units, into the appropriate equation and obtain 
numerical solutions complete with units. 

Step 8. Check the answer to see if it is reasonable: Does it make sense? 


The Van der Waals Equation of State 


We have repeatedly noted that the ideal gas law is an approximation. How can it be improved upon? The 
van der Waals equation of state (named after the Dutch physicist Johannes van der Waals, 1837-1923) 
improves it by taking into account two factors. First, the attractive forces between molecules, which are 
stronger at higher density and reduce the pressure, are taken into account by adding to the pressure a term 
equal to the square of the molar density multiplied by a positive coefficient a. Second, the volume of the 
molecules is represented by a positive constant b, which can be thought of as the volume of a mole of 
molecules. This is subtracted from the total volume to give the remaining volume that the molecules can 
move in. The constants a and b are determined experimentally for each gas. The resulting equation is 


Note: 
Equation: 


p 4. o(F) | (V — nb) = nRT. 


In the limit of low density (small n), the a and b terms are negligible, and we have the ideal gas law, as we 
should for low density. On the other hand, if V — nb is small, meaning that the molecules are very close 
together, the pressure must be higher to give the same nRT, as we would expect in the situation of a highly 
compressed gas. However, the increase in pressure is less than that argument would suggest, because at 
high density the (n/ vy term is significant. Since it’s positive, it causes a lower pressure to give the same 
nRT, 


The van der Waals equation of state works well for most gases under a wide variety of conditions. As we’ ll 
see in the next module, it even predicts the gas-liquid transition. 


pV Diagrams 


We can examine aspects of the behavior of a substance by plotting a pV diagram, which is a graph of 
pressure versus volume. When the substance behaves like an ideal gas, the ideal gas law pV = nRT 
describes the relationship between its pressure and volume. On a pV diagram, it’s common to plot an 
isotherm, which is a curve showing p as a function of V with the number of molecules and the temperature 
fixed. Then, for an ideal gas, pV = constant. For example, the volume of the gas decreases as the pressure 
increases. The resulting graph is a hyperbola. 


However, if we assume the van der Waals equation of state, the isotherms become more interesting, as 
shown in [link]. At high temperatures, the curves are approximately hyperbolas, representing approximately 
ideal behavior at various fixed temperatures. At lower temperatures, the curves look less and less like 
hyperbolas—that is, the gas is not behaving ideally. There is a critical temperature J, at which the curve 
has a point with zero slope. Below that temperature, the curves do not decrease monotonically; instead, they 
each have a “hump,” meaning that for a certain range of volume, increasing the volume increases the 
pressure. 


Pressure, p 


Volume, V 


pV diagram for a Van der Waals gas at various temperatures. The red curves are 

calculated at temperatures above the critical temperature and the blue curves at 

temperatures below it. The blue curves have an oscillation in which volume (V) 

increases with increasing temperature (7), an impossible situation, so they must 
be corrected as in [link]. (credit: “Eman”/Wikimedia Commons) 


Such behavior would be completely unphysical. Instead, the curves are understood as describing a liquid- 
gas phase transition. The oscillating part of the curve is replaced by a horizontal line, showing that as the 
volume increases at constant temperature, the pressure stays constant. That behavior corresponds to boiling 
and condensation; when a substance is at its boiling temperature for a particular pressure, it can increase in 
volume as some of the liquid turns to gas, or decrease as some of the gas turns to liquid, without any change 


in temperature or pressure. 


[link] shows similar isotherms that are more realistic than those based on the van der Waals equation. The 
steep parts of the curves to the left of the transition region show the liquid phase, which is almost 
incompressible—a slight decrease in volume requires a large increase in pressure. The flat parts show the 
liquid-gas transition; the blue regions that they define represent combinations of pressure and volume where 
liquid and gas can coexist. 
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pV diagrams. (a) Each curve (isotherm) represents the relationship between p and V at a fixed 
temperature; the upper curves are at higher temperatures. The lower curves are not hyperbolas because 
the gas is no longer an ideal gas. (b) An expanded portion of the pV diagram for low temperatures, 
where the phase can change from a gas to a liquid. The term “vapor” refers to the gas phase when it 
exists at a temperature below the boiling temperature. 


The isotherms above T, do not go through the liquid-gas transition. Therefore, liquid cannot exist above 
that temperature, which is the critical temperature (described in the chapter on temperature and heat). At 
sufficiently low pressure above that temperature, the gas has the density of a liquid but will not condense; 
the gas is said to be supercritical. At higher pressure, it is solid. Carbon dioxide, for example, has no liquid 
phase at a temperature above 31.0 °C. The critical pressure is the maximum pressure at which the liquid can 
exist. The point on the pV diagram at the critical pressure and temperature is the critical point (which you 
learned about in the chapter on temperature and heat). [link] lists representative critical temperatures and 


pressures. 


Substance 


Water 


Sulfur dioxide 


Ammonia 


Carbon dioxide 


Critical temperature 


°C 


374.3 


157.6 


132.4 


31.1 


Critical pressure 


Pa atm 

22.12 x 10° 219.0 
7.88 x 10° 78.0 
11.28 x 10° 111.7 
7.39 x 10° 73.2 


Substance Critical temperature Critical pressure 


Oxygen 154.8 -118.4 5.08 x 10° 50.3 
Nitrogen 126.2 -146.9 3.39 x 10° 33.6 
Hydrogen 33.3 ~239.9 1.30 x 10° 12.9 
Helium 5.3 ~267.9 0.229 x 10° 227 


Critical Temperatures and Pressures for Various Substances 


Summary 


The ideal gas law relates the pressure and volume of a gas to the number of gas molecules and the 
temperature of the gas. 

A mole of any substance has a number of molecules equal to the number of atoms in a 12-g sample of 
carbon-12. The number of molecules in a mole is called Avogadro’s number Na, 

Equation: 


Na = 6.02 x 10% mol}. 


A mole of any substance has a mass in grams numerically equal to its molecular mass in unified mass 
units, which can be determined from the periodic table of elements. The ideal gas law can also be 
written and solved in terms of the number of moles of gas: 

Equation: 


pV =nRT, 


where n is the number of moles and R is the universal gas constant, 
Equation: 


R = 8.313/mol-K. 


The ideal gas law is generally valid at temperatures well above the boiling temperature. 

The van der Waals equation of state for gases is valid closer to the boiling point than the ideal gas law. 
Above the critical temperature and pressure for a given substance, the liquid phase does not exist, and 
the sample is “supercritical.” 


Conceptual Questions 


Exercise: 


Problem: 


Two Hp» molecules can react with one O2 molecule to produce two H2,O molecules. How many moles 
of hydrogen molecules are needed to react with one mole of oxygen molecules? 


Solution: 


2 moles, as that will contain twice as many molecules as the 1 mole of oxygen 


Exercise: 
Problem: 
Under what circumstances would you expect a gas to behave significantly differently than predicted by 
the ideal gas law? 
Exercise: 
Problem: 


A constant-volume gas thermometer contains a fixed amount of gas. What property of the gas is 
measured to indicate its temperature? 


Solution: 


pressure 
Exercise: 
Problem: 
Inflate a balloon at room temperature. Leave the inflated balloon in the refrigerator overnight. What 
happens to the balloon, and why? 
Exercise: 
Problem: 


In the last chapter, free convection was explained as the result of buoyant forces on hot fluids. Explain 
the upward motion of air in flames based on the ideal gas law. 


Solution: 


The flame contains hot gas (heated by combustion). The pressure is still atmospheric pressure, in 
mechanical equilibrium with the air around it (or roughly so). The density of the hot gas is 
proportional to its number density N/V (neglecting the difference in composition between the gas in the 
flame and the surrounding air). At higher temperature than the surrounding air, the ideal gas law says 
that N/V = p/kpgT is less than that of the surrounding air. Therefore the hot air has lower density 
than the surrounding air and is lifted by the buoyant force. 


Problems 


Exercise: 
Problem: 
The gauge pressure in your car tires is 2.50 x 10°N/ m?’ ata temperature of 35.0 °C when you drive 


it onto a ship in Los Angeles to be sent to Alaska. What is their gauge pressure on a night in Alaska 
when their temperature has dropped to —40.0 °C ? Assume the tires have not gained or lost any air. 


Exercise: 


Problem: 


Suppose a gas-filled incandescent light bulb is manufactured so that the gas inside the bulb is at 
atmospheric pressure when the bulb has a temperature of 20.0 °C. (a) Find the gauge pressure inside 
such a bulb when it is hot, assuming its average temperature is 60.0 °C (an approximation) and 
neglecting any change in volume due to thermal expansion or gas leaks. (b) The actual final pressure 
for the light bulb will be less than calculated in part (a) because the glass bulb will expand. Is this 
effect significant? 


Solution: 


a. 0.137 atm; b. p, = (1 atm) an — 1atm. Because of the expansion of the glass, V2 = 0.99973. 


Multiplying by that factor does not make any significant difference. 
Exercise: 
Problem: 
People buying food in sealed bags at high elevations often notice that the bags are puffed up because 
the air inside has expanded. A bag of pretzels was packed at a pressure of 1.00 atm and a temperature 


of 22.0 “C. When opened at a summer picnic in Santa Fe, New Mexico, at a temperature of 32.0 °C, 
the volume of the air in the bag is 1.38 times its original volume. What is the pressure of the air? 


Exercise: 
Problem: 


How many moles are there in (a) 0.0500 g of Nz gas (M = 28.0 g/mol)? (b) 10.0 g of COz gas 
(M = 44.0 g/mol)? (c) How many molecules are present in each case? 


Solution: 
a.1.79 x 10° mol; b. 0.227 mol; c. 1.08 x 107! molecules for the nitrogen, 1.37 x 1078 
molecules for the carbon dioxide 
Exercise: 
Problem: 
A cubic container of volume 2.00 L holds 0.500 mol of nitrogen gas at a temperature of 25.0 °C. 


What is the net force due to the nitrogen on one wall of the container? Compare that force to the 
sample’s weight. 


Exercise: 


Problem: 


Calculate the number of moles in the 2.00-L volume of air in the lungs of the average person. Note 
that the air is at 37.0 °C (body temperature) and that the total volume in the lungs is several times the 
amount inhaled in a typical breath as given in [link]. 


Solution: 


7.84 x 10°? mol 


Exercise: 


Problem: 


An airplane passenger has 100 cm’ of air in his stomach just before the plane takes off from a sea- 
level airport. What volume will the air have at cruising altitude if cabin pressure drops to 
7.50 x 104N/m?? 

Exercise: 
Problem: 
A company advertises that it delivers helium at a gauge pressure of 1.72 x 10’ Pa ina cylinder of 
volume 43.8 L. How many balloons can be inflated to a volume of 4.00 L with that amount of helium? 


Assume the pressure inside the balloons is 1.01 x 10° Pa and the temperature in the cylinder and the 
balloons is 25.0 °C. 


Solution: 


1.87 x 10° 
Exercise: 
Problem: 
According to http://hyperphysics.phy-astr.gsu.edu/hbase/solar/venusenv.html, the atmosphere of Venus 


is approximately 96.5% COz and 3.5% Ne by volume. On the surface, where the temperature is about 
750 K and the pressure is about 90 atm, what is the density of the atmosphere? 


Exercise: 
Problem: 


An expensive vacuum system can achieve a pressure as low as 1.00 x 10-’N/ m? at 20.0 °C. How 
many molecules are there in a cubic centimeter at this pressure and temperature? 


Solution: 


2.47 x 10’ molecules 
Exercise: 
Problem: 


The number density N/V of gas molecules at a certain location in the space above our planet is about 


1.00 x 101! m~%, and the pressure is 2.75 x 10~1° N/m’ in this space. What is the temperature 
there? 


Exercise: 
Problem: 
A bicycle tire contains 2.00 L of gas at an absolute pressure of 7.00 x 10°N / m? anda temperature 
of 18.0 °C. What will its pressure be if you let out an amount of air that has a volume of 100 cm? at 
atmospheric pressure? Assume tire temperature and volume remain constant. 


Solution: 


6.95 x 10° Pa; 6.86 atm 


Exercise: 
Problem: 
In acommon demonstration, a bottle is heated and stoppered with a hard-boiled egg that’s a little 
bigger than the bottle’s neck. When the bottle is cooled, the pressure difference between inside and 
outside forces the egg into the bottle. Suppose the bottle has a volume of 0.500 L and the temperature 
inside it is raised to 80.0 °C while the pressure remains constant at 1.00 atm because the bottle is 
open. (a) How many moles of air are inside? (b) Now the egg is put in place, sealing the bottle. What 


is the gauge pressure inside after the air cools back to the ambient temperature of 25 °C but before the 
egg is forced into the bottle? 


Exercise: 


Problem: 


A high-pressure gas cylinder contains 50.0 L of toxic gas at a pressure of 1.40 x 10’N/ m? anda 
temperature of 25.0 °C. The cylinder is cooled to dry ice temperature (—78.5 °C) to reduce the leak 
rate and pressure so that it can be safely repaired. (a) What is the final pressure in the tank, assuming a 
negligible amount of gas leaks while being cooled and that there is no phase change? (b) What is the 
final pressure if one-tenth of the gas escapes? (c) To what temperature must the tank be cooled to 
reduce the pressure to 1.00 atm (assuming the gas does not change phase and that there is no leakage 
during cooling)? (d) Does cooling the tank as in part (c) appear to be a practical solution? 


Solution: 


a.9.14 x 10° Pa; b. 8.22 x 10° Pa; c. 2.15 K; d. no 
Exercise: 


Problem: 


Find the number of moles in 2.00 L of gas at 35.0 °C and under 7.41 x 10’ N/m of pressure. 
Exercise: 


Problem: 


Calculate the depth to which Avogadro’s number of table tennis balls would cover Earth. Each ball has 
a diameter of 3.75 cm. Assume the space between balls adds an extra 25.0% to their volume and 
assume they are not crushed by their own weight. 


Solution: 


40.7 km 
Exercise: 


Problem: 

(a) What is the gauge pressure in a 25.0 °C car tire containing 3.60 mol of gas in a 30.0-L volume? (b) 
What will its gauge pressure be if you add 1.00 L of gas originally at atmospheric pressure and 

25.0 “C ? Assume the temperature remains at 25.0 °C and the volume remains constant. 


Glossary 


Avogadro’s number 


Na, the number of molecules in one mole of a substance; Na = 6.02 x 1078 particles/mole 


Boltzmann constant 
kg, a physical constant that relates energy to temperature and appears in the ideal gas law; 
kg = 1.38 x 10°73 J/K 


critical temperature 
T,, at which the isotherm has a point with zero slope 


ideal gas 
gas at the limit of low density and high temperature 


ideal gas law 
physical law that relates the pressure and volume of a gas, far from liquefaction, to the number of gas 
molecules or number of moles of gas and the temperature of the gas 


mole 
quantity of a substance whose mass (in grams) is equal to its molecular mass 


pV diagram 
graph of pressure vs. volume 


supercritical 
condition of a fluid being at such a high temperature and pressure that the liquid phase cannot exist 


universal gas constant 
R, the constant that appears in the ideal gas law expressed in terms of moles, given by R = Nykp 


van der Waals equation of state 
equation, typically approximate, which relates the pressure and volume of a gas to the number of gas 
molecules or number of moles of gas and the temperature of the gas 


Pressure, Temperature, and RMS Speed 
By the end of this section, you will be able to: 


e Explain the relations between microscopic and macroscopic quantities in a gas 
e Solve problems involving mixtures of gases 
e Solve problems involving the distance and time between a gas molecule’s collisions 


We have examined pressure and temperature based on their macroscopic definitions. Pressure is the force 
divided by the area on which the force is exerted, and temperature is measured with a thermometer. We 
can gain a better understanding of pressure and temperature from the kinetic theory of gases, the theory 
that relates the macroscopic properties of gases to the motion of the molecules they consist of. First, we 
make two assumptions about molecules in an ideal gas. 


1. There is a very large number N of molecules, all identical and each having mass m. 
2. The molecules obey Newton’s laws and are in continuous motion, which is random and isotropic, 
that is, the same in all directions. 


To derive the ideal gas law and the connection between microscopic quantities such as the energy of a 
typical molecule and macroscopic quantities such as temperature, we analyze a sample of an ideal gas in a 
rigid container, about which we make two further assumptions: 


3. The molecules are much smaller than the average distance between them, so their total volume is 
much less than that of their container (which has volume V). In other words, we take the Van der 
Waals constant b, the volume of a mole of gas molecules, to be negligible compared to the volume of 
a mole of gas in the container. 

4. The molecules make perfectly elastic collisions with the walls of the container and with each other. 
Other forces on them, including gravity and the attractions represented by the Van der Waals 
constant a, are negligible (as is necessary for the assumption of isotropy). 


The collisions between molecules do not appear in the derivation of the ideal gas law. They do not disturb 
the derivation either, since collisions between molecules moving with random velocities give new random 
velocities. Furthermore, if the velocities of gas molecules in a container are initially not random and 
isotropic, molecular collisions are what make them random and isotropic. 


We make still further assumptions that simplify the calculations but do not affect the result. First, we let 
the container be a rectangular box. Second, we begin by considering monatomic gases, those whose 
molecules consist of single atoms, such as helium. Then, we can assume that the atoms have no energy 
except their translational kinetic energy; for instance, they have neither rotational nor vibrational energy. 
(Later, we discuss the validity of this assumption for real monatomic gases and dispense with it to 
consider diatomic and polyatomic gases.) 


[link] shows a collision of a gas molecule with the wall of a container, so that it exerts a force on the wall 
(by Newton’s third law). These collisions are the source of pressure in a gas. As the number of molecules 
increases, the number of collisions, and thus the pressure, increases. Similarly, if the average velocity of 
the molecules is higher, the gas pressure is higher. 


When a molecule collides with a 
rigid wall, the component of its 
momentum perpendicular to the 
wall is reversed. A force is thus 
exerted on the wall, creating 


pressure. 


In a sample of gas in a container, the randomness of the molecular motion causes the number of collisions 
of molecules with any part of the wall in a given time to fluctuate. However, because a huge number of 
molecules collide with the wall in a short time, the number of collisions on the scales of time and space 
we measure fluctuates by only a tiny, usually unobservable fraction from the average. We can compare 
this situation to that of a casino, where the outcomes of the bets are random and the casino’s takings 
fluctuate by the minute and the hour. However, over long times such as a year, the casino’s takings are 
very close to the averages expected from the odds. A tank of gas has enormously more molecules than a 
casino has bettors in a year, and the molecules make enormously more collisions in a second than a casino 
has bets. 


A calculation of the average force exerted by molecules on the walls of the box leads us to the ideal gas 
law and to the connection between temperature and molecular kinetic energy. (In fact, we will take two 
averages: one over time to get the average force exerted by one molecule with a given velocity, and then 
another average over molecules with different velocities.) This approach was developed by Daniel 
Bernoulli (1700-1782), who is best known in physics for his work on fluid flow (hydrodynamics). 
Remarkably, Bernoulli did this work before Dalton established the view of matter as consisting of atoms. 


[link] shows a container full of gas and an expanded view of an elastic collision of a gas molecule with a 
wall of the container, broken down into components. We have assumed that a molecule is small compared 
with the separation of molecules in the gas, and that its interaction with other molecules can be ignored. 


Under these conditions, the ideal gas law is experimentally valid. Because we have also assumed the wall 
is rigid and the particles are points, the collision is elastic (by conservation of energy—there’s nowhere 
for a particle’s kinetic energy to go). Therefore, the molecule’s kinetic energy remains constant, and 
hence, its speed and the magnitude of its momentum remain constant as well. This assumption is not 
always valid, but the results in the rest of this module are also obtained in models that let the molecules 
exchange energy and momentum with the wall. 


Gas in a box exerts an outward pressure on its 
walls. A molecule colliding with a rigid wall 
has its velocity and momentum in the x- 
direction reversed. This direction is 
perpendicular to the wall. The components of 
its velocity momentum in the y- and z- 
directions are not changed, which means 
there is no force parallel to the wall. 


If the molecule’s velocity changes in the x-direction, its momentum changes from —mv, to +mv,. Thus, 
its change in momentum is Amv = +mvz — (—mvz) = 2mv,. According to the impulse-momentum 
theorem given in the chapter on linear momentum and collisions, the force exerted on the ith molecule, 
where i labels the molecules from 1 to N, is given by 

Equation: 


— Ap, — AmMviz 


= he > he 


(in this equation alone, p represents momentum, not pressure.) There is no force between the wall and the 
molecule except while the molecule is touching the wall. During the short time of the collision, the force 
between the molecule and wall is relatively large, but that is not the force we are looking for. We are 
looking for the average force, so we take At to be the average time between collisions of the given 
molecule with this wall, which is the time in which we expect to find one collision. Let | represent the 
length of the box in the x-direction. Then At is the time the molecule would take to go across the box and 
back, a distance 21, at a speed of vz. Thus At = 21/vz, and the expression for the force becomes 
Equation: 


_ AMVvie M5, 


= 21/Vix = l 


a 


This force is due to one molecule. To find the total force on the wall, F, we need to add the contributions 
of all N molecules: 
Equation: 


We now use the definition of the average, which we denote with a bar, to find the force: 
Equation: 


We want the force in terms of the speed v, rather than the x-component of the velocity. Note that the total 
velocity squared is the sum of the squares of its components, so that 
Equation: 


2 — y2 24 42 
= Ua we 


With the assumption of isotropy, the three averages on the right side are equal, so 
Equation: 


= 43 
v* = 3vj,. 


Substituting this into the expression for F gives 
Equation: 


The pressure is F/A, so we obtain 
Equation: 


F 
= = N 
P A 


where we used V = AI for the volume. This gives the important result 


Note: 
Equation: 


pV = — Nmv’?. 


Combining this equation with pV = NkgT gives 
Equation: 


1 = 
3 Nmv? = NkpT. 


We can get the average kinetic energy of a molecule, $ mv?, from the left-hand side of the equation by 
dividing out N and multiplying by 3/2. 


Note: 
Average Kinetic Energy per Molecule 
The average kinetic energy of a molecule is directly proportional to its absolute temperature: 
Equation: 
1 a 


= 3 
K = = mv? = — kpT. 
2 2 


The equation K = 3 


this equation depends on the molecular mass (or any other property) of the gas, the pressure, or anything 
but the temperature. If samples of helium and xenon gas, with very different molecular masses, are at the 
same temperature, the molecules have the same average kinetic energy. 


kpT is the average kinetic energy per molecule. Note in particular that nothing in 


The internal energy of a thermodynamic system is the sum of the mechanical energies of all of the 
molecules in it. We can now give an equation for the internal energy of a monatomic ideal gas. In such a 
gas, the molecules’ only energy is their translational kinetic energy. Therefore, denoting the internal 


energy by Hint, we simply have Fing = NK, or 


Note: 


Equation: 


3 
Bi = 2 NkpT. 


Often we would like to use this equation in terms of moles: 
Equation: 


3 
Fint = 2 nRT. 


We can solve K = ~ mv? = - kpT for a typical speed of a molecule in an ideal gas in terms of 


temperature to determine what is known as the root-mean-square (rms) speed of a molecule. 


Note: 
RMS Speed of a Molecule 
The root-mean-square (rms) speed of a molecule, or the square root of the average of the square of the 


speedv?, is 
[= [3keT 
Urms = Ve = . 
m 


Equation: 


The rms speed is not the average or the most likely speed of molecules, as we will see in Distribution of 
Molecular Speeds, but it provides an easily calculated estimate of the molecules’ speed that is related to 


their kinetic energy. Again we can write this equation in terms of the gas constant R and the molar mass 
Min kg/mol: 


Note: 
Equation: 


We digress for a moment to answer a question that may have occurred to you: When we apply the model 
to atoms instead of theoretical point particles, does rotational kinetic energy change our results? To 
answer this question, we have to appeal to quantum mechanics. In quantum mechanics, rotational kinetic 
energy cannot take on just any value; it’s limited to a discrete set of values, and the smallest value is 
inversely proportional to the rotational inertia. The rotational inertia of an atom is tiny because almost all 


of its mass is in the nucleus, which typically has a radius less than 10‘ m. Thus the minimum rotational 
energy of an atom is much more than + kpT for any attainable temperature, and the energy available is 


not enough to make an atom rotate. We will return to this point when discussing diatomic and polyatomic 
gases in the next section. 


Example: 

Calculating Kinetic Energy and Speed of a Gas Molecule 

(a) What is the average kinetic energy of a gas molecule at 20.0 °C (room temperature)? (b) Find the rms 
speed of a nitrogen molecule (N2) at this temperature. 


Strategy 
(a) The known in the equation for the average kinetic energy is the temperature: 
Equation: 
= los 3 
K = — mv? = —kpT. 
2 2 


Before substituting values into this equation, we must convert the given temperature into kelvin: 
T = (20.0 + 273) K = 293 K. We can find the rms speed of a nitrogen molecule by using the equation 


Equation: 
is | 3kpT 
Urms = Us — are) 
m 


but we must first find the mass of a nitrogen molecule. Obtaining the molar mass of nitrogen Nz from the 
periodic table, we find 
Equation: 

M _ 2(14.0067) x 10~* kg/mol) 


————— = 4.65 x 10° kg. 
Na 6.02 x 1073 molt 


Solution 


a. The temperature alone is sufficient for us to find the average translational kinetic energy. 
Substituting the temperature into the translational kinetic energy equation gives 


Equation: 
Ie 3 3 =98 =i 
= A= 5 (138 SOLO Sy (2931 6070 aod 
b. Substituting this mass and the value for kg into the equation for vpms yields 
Equation: 
3kpT 3(1.38 x 10°73 J/K)(293 K 
Urms = / 2 = eS ees) => 511 m/s. 
m aly 3< II)" lige 
Significance 


Note that the average kinetic energy of the molecule is independent of the type of molecule. The average 
translational kinetic energy depends only on absolute temperature. The kinetic energy is very small 
compared to macroscopic energies, so that we do not feel when an air molecule is hitting our skin. On the 


other hand, it is much greater than the typical difference in gravitational potential energy when a 
molecule moves from, say, the top to the bottom of a room, so our neglect of gravitation is justified in 
typical real-world situations. The rms speed of the nitrogen molecule is surprisingly large. These large 
molecular velocities do not yield macroscopic movement of air, since the molecules move in all 
directions with equal likelihood. The mean free path (the distance a molecule moves on average between 
collisions, discussed a bit later in this section) of molecules in air is very small, so the molecules move 
rapidly but do not get very far in a second. The high value for rms speed is reflected in the speed of 
sound, which is about 340 m/s at room temperature. The higher the rms speed of air molecules, the faster 
sound vibrations can be transferred through the air. The speed of sound increases with temperature and is 
greater in gases with small molecular masses, such as helium (see [link]). 

Wave front of sound 
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(a) In an ordinary gas, so many molecules move so fast that they collide 
billions of times every second. (b) Individual molecules do not move very 
far in a small amount of time, but disturbances like sound waves are 
transmitted at speeds related to the molecular speeds. 


Example: 

Calculating Temperature: Escape Velocity of Helium Atoms 

To escape Earth’s gravity, an object near the top of the atmosphere (at an altitude of 100 km) must travel 
away from Earth at 11.1 km/s. This speed is called the escape velocity. At what temperature would 
helium atoms have an rms speed equal to the escape velocity? 

Strategy 

Identify the knowns and unknowns and determine which equations to use to solve the problem. 
Solution 


1. Identify the knowns: v is the escape velocity, 11.1 km/s. 

2. Identify the unknowns: We need to solve for temperature, T. We also need to solve for the mass m 
of the helium atom. 

3. Determine which equations are needed. 


o To get the mass m of the helium atom, we can use information from the periodic table: 


Equation: 
M 
— 
Na 
° To solve for temperature T, we can rearrange 
Equation: 
lige 3 
— mv? = — kpT 
2 ge 
to yield 
Equation: 
_ mv? 
 3kp 


4. Substitute the known values into the equations and solve for the unknowns, 


Equation: 
M 4.0026 x 10°-°k 1 
ee ee 10-2” ke 
Na 6.02 x 107° mol 
and 
Equation: 
6.65 x 10-2’ kg) (11.1 x 102?m/s)’ 
r-! 8) ( /s) =1.98 x 10*K. 
B(s8 x 10677) kK) 
Significance 


This temperature is much higher than atmospheric temperature, which is approximately 250 K 

(—25 °C or — 10 °F) at high elevation. Very few helium atoms are left in the atmosphere, but many 
were present when the atmosphere was formed, and more are always being created by radioactive decay 
(see the chapter on nuclear physics). The reason for the loss of helium atoms is that a small number of 
helium atoms have speeds higher than Earth’s escape velocity even at normal temperatures. The speed of 
a helium atom changes from one collision to the next, so that at any instant, there is a small but nonzero 
chance that the atom’s speed is greater than the escape velocity. The chance is high enough that over the 
lifetime of Earth, almost all the helium atoms that have been in the atmosphere have reached escape 
velocity at high altitudes and escaped from Earth’s gravitational pull. Heavier molecules, such as oxygen, 
nitrogen, and water, have smaller rms speeds, and so it is much less likely that any of them will have 
speeds greater than the escape velocity. In fact, the likelihood is so small that billions of years are 
required to lose significant amounts of heavier molecules from the atmosphere. [link] shows the effect of 
a lack of an atmosphere on the Moon. Because the gravitational pull of the Moon is much weaker, it has 
lost almost its entire atmosphere. The atmospheres of Earth and other bodies are compared in this 
chapter’s exercises. 


This photograph of Apollo 17 Commander Eugene 
Cernan driving the lunar rover on the Moon in 1972 
looks as though it was taken at night with a large 
spotlight. In fact, the light is coming from the Sun. 
Because the acceleration due to gravity on the Moon is 
so low (about 1/6 that of Earth), the Moon’s escape 
velocity is much smaller. As a result, gas molecules 
escape very easily from the Moon, leaving it with 
virtually no atmosphere. Even during the daytime, the 
sky is black because there is no gas to scatter sunlight. 
(credit: Harrison H. Schmitt/NASA) 


Note: 
Exercise: 


Problem: 
Check Your Understanding If you consider a very small object, such as a grain of pollen, in a gas, 
then the number of molecules striking its surface would also be relatively small. Would you expect 


the grain of pollen to experience any fluctuations in pressure due to statistical fluctuations in the 
number of gas molecules striking it in a given amount of time? 


Solution: 


Yes. Such fluctuations actually occur for a body of any size in a gas, but since the numbers of 
molecules are immense for macroscopic bodies, the fluctuations are a tiny percentage of the number 


of collisions, and the averages spoken of in this section vary imperceptibly. Roughly speaking, the 
fluctuations are inversely proportional to the square root of the number of collisions, so for small 
bodies, they can become significant. This was actually observed in the nineteenth century for pollen 
grains in water and is known as Brownian motion. 


Vapor Pressure, Partial Pressure, and Dalton’s Law 


The pressure a gas would create if it occupied the total volume available is called the gas’s partial 
pressure. If two or more gases are mixed, they will come to thermal equilibrium as a result of collisions 
between molecules; the process is analogous to heat conduction as described in the chapter on 
temperature and heat. As we have seen from kinetic theory, when the gases have the same temperature, 
their molecules have the same average kinetic energy. Thus, each gas obeys the ideal gas law separately 
and exerts the same pressure on the walls of a container that it would if it were alone. Therefore, in a 
mixture of gases, the total pressure is the sum of partial pressures of the component gases, assuming ideal 
gas behavior and no chemical reactions between the components. This law is known as Dalton’s law of 
partial pressures, after the English scientist John Dalton (1766-1844) who proposed it. Dalton’s law is 
consistent with the fact that pressures add according to Pascal’s principle. 


In a mixture of ideal gases in thermal equilibrium, the number of molecules of each gas is proportional to 
its partial pressure. This result follows from applying the ideal gas law to each in the form p/n = RT/V. 
Because the right-hand side is the same for any gas at a given temperature in a container of a given 
volume, the left-hand side is the same as well. 


e Partial pressure is the pressure a gas would create if it existed alone. 
e Dalton’s law states that the total pressure is the sum of the partial pressures of all of the gases 
present. 

e For any two gases (labeled 1 and 2) in equilibrium in a container, a = a 
An important application of partial pressure is that, in chemistry, it functions as the concentration of a gas 
in determining the rate of a reaction. Here, we mention only that the partial pressure of oxygen ina 
person’s lungs is crucial to life and health. Breathing air that has a partial pressure of oxygen below 0.16 
atm can impair coordination and judgment, particularly in people not acclimated to a high elevation. 
Lower partial pressures of Oz have more serious effects; partial pressures below 0.06 atm can be quickly 
fatal, and permanent damage is likely even if the person is rescued. However, the sensation of needing to 
breathe, as when holding one’s breath, is caused much more by high concentrations of carbon dioxide in 
the blood than by low concentrations of oxygen. Thus, if a small room or closet is filled with air having a 
low concentration of oxygen, perhaps because a leaking cylinder of some compressed gas is stored there, 
a person will not feel any “choking” sensation and may go into convulsions or lose consciousness without 
noticing anything wrong. Safety engineers give considerable attention to this danger. 


Another important application of partial pressure is vapor pressure, which is the partial pressure of a 
vapor at which it is in equilibrium with the liquid (or solid, in the case of sublimation) phase of the same 
substance. At any temperature, the partial pressure of the water in the air cannot exceed the vapor 
pressure of the water at that temperature, because whenever the partial pressure reaches the vapor 
pressure, water condenses out of the air. Dew is an example of this condensation. The temperature at 
which condensation occurs for a sample of air is called the dew point. It is easily measured by slowly 
cooling a metal ball; the dew point is the temperature at which condensation first appears on the ball. 


The vapor pressures of water at some temperatures of interest for meteorology are given in [Link]. 


T(‘C) Vapor Pressure (Pa) 


0 610.5 

3 757.9 

5 872.3 

8 1073 
10 1228 
13 1497 
15 1705 
18 2063 
20 2338 
23 2809 
25 3167 
30 4243 
35 5623 
AO 7376 


Vapor Pressure of Water at Various Temperatures 


The relative humidity (R.H.) at a temperature T is defined by 
Equation: 


_ Partial pressure of water vapor at T 


R.H. x 100%. 


Vapor pressure of water at T 


A relative humidity of 100% means that the partial pressure of water is equal to the vapor pressure; in 
other words, the air is saturated with water. 


Example: 

Calculating Relative Humidity 

What is the relative humidity when the air temperature is 25 °C and the dew point is 15 °C? 
Strategy 

We simply look up the vapor pressure at the given temperature and that at the dew point and find the 
ratio. 

Solution 

Equation: 


_ Partial pressure of water vapor at 15 °C 1705 Pa 


ca 00 
H UU rer as 


x 100% = 53.8%. 


Partial pressure of water vapor at 25 °C 


Significance 

R.H. is important to our comfort. The value of 53.8 % is within the range of 40 % to 60 % recommended 
for comfort indoors. 

As noted in the chapter on temperature and heat, the temperature seldom falls below the dew point, 
because when it reaches the dew point or frost point, water condenses and releases a relatively large 
amount of latent heat of vaporization. 


Mean Free Path and Mean Free Time 


We now consider collisions explicitly. The usual first step (which is all we’ll take) is to calculate the 
mean free path, 4, the average distance a molecule travels between collisions with other molecules, and 
the mean free time T, the average time between the collisions of a molecule. If we assume all the 
molecules are spheres with a radius r, then a molecule will collide with another if their centers are within 
a distance 2r of each other. For a given particle, we say that the area of a circle with that radius, 47r?, is 
the “cross-section” for collisions. As the particle moves, it traces a cylinder with that cross-sectional area. 
The mean free path is the length A such that the expected number of other molecules in a cylinder of 
length \ and cross-section 47rr? is 1. If we temporarily ignore the motion of the molecules other than the 
one we’re looking at, the expected number is the number density of molecules, N/V, times the volume, 
and the volume is 47r?A, so we have (N/V)4rr? = 1, or 

Equation: 


ieee. 
Anr2N 


Taking the motion of all the molecules into account makes the calculation much harder, but the only 
change is a factor of V2. The result is 


Note: 
Equation: 


ey: 
4V/2nr2N 


In an ideal gas, we can substitute V/N = kpT’/p to obtain 


Note: 
Equation: 


The mean free time 7 is simply the mean free path divided by a typical speed, and the usual choice is the 
rms speed. Then 


Note: 
Equation: 
kpT 
oe 
Ay/2rr2pUrms 
Example: 


Calculating Mean Free Time 

Find the mean free time for argon atoms (IM = 39.9 g/mol) at a temperature of 0 °C and a pressure of 
1.00 atm. Take the radius of an argon atom to be 1.70 x 10°71? m. 

Solution 


1. Identify the knowns and convert into SI units. We know the molar mass is 0.0399 kg/mol, the 
temperature is 273 K, the pressure is 1.01 x 10° Pa, and the radius is 1.70 x 10-1? m. 


2. Find the rms speed: Yrms = 4/ SHE = ais at 
3. Substitute into the equation for the mean free time: 


Equation: 
kpT (138-2102 IK 273 1) 


T= 6 x 107, 
AV2nr2pirms 4/2 (1.70 x 107! m)"(1.01 x 10° Pa)(413 m/s) 


Significance 
We can hardly compare this result with our intuition about gas molecules, but it gives us a picture of 
molecules colliding with extremely high frequency. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Which has a longer mean free path, liquid water or water vapor in the 
air? 


Solution: 


In a liquid, the molecules are very close together, constantly colliding with one another. For a gas to 
be nearly ideal, as air is under ordinary conditions, the molecules must be very far apart. Therefore 


the mean free path is much longer in the air. 


Summary 


e Kinetic theory is the atomic description of gases as well as liquids and solids. It models the 
properties of matter in terms of continuous random motion of molecules. 


e The ideal gas law can be expressed in terms of the mass of the gas’s molecules and v?, the average 
of the molecular speed squared, instead of the temperature. 

e The temperature of gases is proportional to the average translational kinetic energy of molecules. 
Hence, the typical speed of gas molecules vm; is proportional to the square root of the temperature 
and inversely proportional to the square root of the molecular mass. 

e Inamixture of gases, each gas exerts a pressure equal to the total pressure times the fraction of the 
mixture that the gas makes up. 

e The mean free path (the average distance between collisions) and the mean free time of gas 
molecules are proportional to the temperature and inversely proportional to the molar density and the 
molecules’ cross-sectional area. 


Conceptual Questions 


Exercise: 
Problem: 
How is momentum related to the pressure exerted by a gas? Explain on the molecular level, 
considering the behavior of molecules. 
Exercise: 
Problem: 


If one kind of molecule has double the radius of another and eight times the mass, how do their mean 
free paths under the same conditions compare? How do their mean free times compare? 


Solution: 


The mean free path is inversely proportional to the square of the radius, so it decreases by a factor of 
4. The mean free time is proportional to the mean free path and inversely proportional to the rms 
speed, which in turn is inversely proportional to the square root of the mass. That gives a factor of 


4/8 in the numerator, so the mean free time decreases by a factor of V/ 2. 


Exercise: 


Problem: What is the average velocity of the air molecules in the room where you are right now? 
Exercise: 


Problem: 


Why do the atmospheres of Jupiter, Saturn, Uranus, and Neptune, which are much more massive and 
farther from the Sun than Earth is, contain large amounts of hydrogen and helium? 


Solution: 


Since they’re more massive, their gravity is stronger, so the escape velocity from them is higher. 
Since they’re farther from the Sun, they’re colder, so the speeds of atmospheric molecules including 
hydrogen and helium are lower. The combination of those facts means that relatively few hydrogen 
and helium molecules have escaped from the outer planets. 


Exercise: 
Problem: 


Statistical mechanics says that in a gas maintained at a constant temperature through thermal contact 
with a bigger system (a “reservoir”) at that temperature, the fluctuations in internal energy are 


typically a fraction 1/ N of the internal energy. As a fraction of the total internal energy of a mole 
of gas, how big are the fluctuations in the internal energy? Are we justified in ignoring them? 


Exercise: 
Problem: 


Which is more dangerous, a closet where tanks of nitrogen are stored, or one where tanks of carbon 
dioxide are stored? 


Solution: 


One where nitrogen is stored, as excess COz2 will cause a feeling of suffocating, but excess nitrogen 
and insufficient oxygen will not. 


Problems 

In the problems in this section, assume all gases are ideal. 

Exercise: 
Problem: 
A person hits a tennis ball with a mass of 0.058 kg against a wall. The average component of the 
ball’s velocity perpendicular to the wall is 11 m/s, and the ball hits the wall every 2.1 s on average, 
rebounding with the opposite perpendicular velocity component. (a) What is the average force 


exerted on the wall? (b) If the part of the wall the person hits has an area of 3.0 m?, what is the 
average pressure on that area? 


Solution: 


a. 0.61 N; b. 0.20 Pa 
Exercise: 
Problem: 
A person is in a closed room (a racquetball court) with V = 453 m® hitting a ball (m = 42.0 g) 
around at random without any pauses. The average kinetic energy of the ball is 2.30 J. (a) What is 


the average value of v2? Does it matter which direction you take to be x? (b) Applying the methods 
of this chapter, find the average pressure on the walls? (c) Aside from the presence of only one 


that does not apply here? 


Exercise: 


Problem: 


Five bicyclists are riding at the following speeds: 5.4 m/s, 5.7 m/s, 5.8 m/s, 6.0 m/s, and 6.5 m/s. (a) 
What is their average speed? (b) What is their rms speed? 


Solution: 


a. 5.88 m/s; b. 5.89 m/s 
Exercise: 
Problem: 
Some incandescent light bulbs are filled with argon gas. What is Urms for argon atoms near the 
filament, assuming their temperature is 2500 K? 
Exercise: 
Problem: 


Typical molecular speeds (v;:ms) are large, even at low temperatures. What is Upms for helium atoms 
at 5.00 K, less than one degree above helium’s liquefaction temperature? 


Solution: 


177 m/s 
Exercise: 
Problem: 
What is the average kinetic energy in joules of hydrogen atoms on the 5500 °C surface of the Sun? 


(b) What is the average kinetic energy of helium atoms in a region of the solar corona where the 
temperature is 6.00 x 10°K? 


Exercise: 
Problem: 
What is the ratio of the average translational kinetic energy of a nitrogen molecule at a temperature 


of 300 K to the gravitational potential energy of a nitrogen-molecule—Earth system at the ceiling of a 
3-m-tall room with respect to the same system with the molecule at the floor? 


Solution: 


4.54 x 10° 
Exercise: 
Problem: 
What is the total translational kinetic energy of the air molecules in a room of volume 23 m? if the 


pressure is 9.5 x 10* Pa (the room is at fairly high elevation) and the temperature is 21 °C ? Is any 
item of data unnecessary for the solution? 


Exercise: 


Problem: 


The product of the pressure and volume of a sample of hydrogen gas at 0.00 °C is 80.0 J. (a) How 
many moles of hydrogen are present? (b) What is the average translational kinetic energy of the 
hydrogen molecules? (c) What is the value of the product of pressure and volume at 200 °C? 


Solution: 


a. 0.0352 mol; b. 5.65 x 10°71 J;c. 139 J 
Exercise: 
Problem: 
What is the gauge pressure inside a tank of 4.86 x 104 mol of compressed nitrogen with a volume 
of 6.56 m? if the rms speed is 514 m/s? 
Exercise: 
Problem: 
If the rms speed of oxygen molecules inside a refrigerator of volume 22.0 ft.° is 465 m/s, what is the 


partial pressure of the oxygen? There are 5.71 moles of oxygen in the refrigerator, and the molar 
mass of oxygen is 32.0 g/mol. 


Solution: 


21.1 kPa 
Exercise: 
Problem: 
The escape velocity of any object from Earth is 11.1 km/s. At what temperature would oxygen 


molecules (molar mass is equal to 32.0 g/mol) have root-mean-square velocity Urms equal to Earth’s 
escape velocity of 11.1 km/s? 


Exercise: 
Problem: 
The escape velocity from the Moon is much smaller than that from the Earth, only 2.38 km/s. At 


what temperature would hydrogen molecules (molar mass is equal to 2.016 g/mol) have a root-mean- 
square velocity ¥;ms equal to the Moon’s escape velocity? 


Solution: 


458 K 
Exercise: 
Problem: 
Nuclear fusion, the energy source of the Sun, hydrogen bombs, and fusion reactors, occurs much 
more readily when the average kinetic energy of the atoms is high—that is, at high temperatures. 


Suppose you want the atoms in your fusion experiment to have average kinetic energies of 
6.40 x 10 -'4 J. What temperature is needed? 


Exercise: 


Problem: 


Suppose that the typical speed (v;ms) of carbon dioxide molecules (molar mass is 44.0 g/mol) in a 
flame is found to be 1350 m/s. What temperature does this indicate? 


Solution: 


3.22 x 10°K 
Exercise: 


Problem: 


(a) Hydrogen molecules (molar mass is equal to 2.016 g/mol) have v;ms equal to 193 m/s. What is 
the temperature? (b) Much of the gas near the Sun is atomic hydrogen (H rather than H2). Its 
temperature would have to be 1.5 x 10’ K for the rms speed vm; to equal the escape velocity from 
the Sun. What is that velocity? 


Exercise: 


Problem: 


There are two important isotopes of uranium, 7°°U and 7°°U; these isotopes are nearly identical 
chemically but have different atomic masses. Only 7°°U is very useful in nuclear reactors. 
Separating the isotopes is called uranium enrichment (and is often in the news as of this writing, 
because of concerns that some countries are enriching uranium with the goal of making nuclear 
weapons.) One of the techniques for enrichment, gas diffusion, is based on the different molecular 
speeds of uranium hexafluoride gas, UF. (a) The molar masses of 7°°U and ?°°UF. are 349.0 
g/mol and 352.0 g/mol, respectively. What is the ratio of their typical speeds v,ms? (b) At what 
temperature would their typical speeds differ by 1.00 m/s? (c) Do your answers in this problem 
imply that this technique may be difficult? 


Solution: 


a. 1.004; b. 764 K; c. This temperature is equivalent to 915 °F, which is high but not impossible to 
achieve. Thus, this process is feasible. At this temperature, however, there may be other 
considerations that make the process difficult. (In general, uranium enrichment by gaseous diffusion 
is indeed difficult and requires many passes.) 


Exercise: 
Problem: 
The partial pressure of carbon dioxide in the lungs is about 470 Pa when the total pressure in the 


lungs is 1.0 atm. What percentage of the air molecules in the lungs is carbon dioxide? Compare your 
result to the percentage of carbon dioxide in the atmosphere, about 0.033%. 


Exercise: 
Problem: 
Dry air consists of approximately 78% nitrogen, 21% oxygen, and 1% argon by mole, with trace 
amounts of other gases. A tank of compressed dry air has a volume of 1.76 cubic feet at a gauge 


pressure of 2200 pounds per square inch and a temperature of 293 K. How much oxygen does it 
contain in moles? 


Solution: 


65 mol 

Exercise: 
Problem: 
(a) Using data from the previous problem, find the mass of nitrogen, oxygen, and argon in 1 mol of 
dry air. The molar mass of N2 is 28.0 g/mol, that of Og is 32.0 g/mol, and that of argon is 39.9 
g/mol. (b) Dry air is mixed with pentane (C5H12, molar mass 72.2 g/mol), an important constituent 


of gasoline, in an air-fuel ratio of 15:1 by mass (roughly typical for car engines). Find the partial 
pressure of pentane in this mixture at an overall pressure of 1.00 atm. 


Exercise: 
Problem: 
(a) Given that air is 21% oxygen, find the minimum atmospheric pressure that gives a relatively safe 
partial pressure of oxygen of 0.16 atm. (b) What is the minimum pressure that gives a partial 
pressure of oxygen above the quickly fatal level of 0.06 atm? (c) The air pressure at the summit of 


Mount Everest (8848 m) is 0.334 atm. Why have a few people climbed it without oxygen, while 
some who have tried, even though they had trained at high elevation, had to turn back? 


Solution: 


a. 0.76 atm; b. 0.29 atm; c. The pressure there is barely above the quickly fatal level. 
Exercise: 

Problem: 

(a) If the partial pressure of water vapor is 8.05 torr, what is the dew point? 


(760 torr = 1 atm = 101,325 Pa) (b) Ona warm day when the air temperature is 35 °C and the 
dew point is 25 °C, what are the partial pressure of the water in the air and the relative humidity? 


Glossary 


Dalton’s law of partial pressures 
physical law that states that the total pressure of a gas is the sum of partial pressures of the 
component gases 


internal energy 
sum of the mechanical energies of all of the molecules in it 


kinetic theory of gases 
theory that derives the macroscopic properties of gases from the motion of the molecules they 
consist of 


mean free path 
average distance between collisions of a particle 


mean free time 
average time between collisions of a particle 


partial pressure 
pressure a gas would create if it occupied the total volume of space available 


root-mean-square (rms) speed 
square root of the average of the square (of a quantity) 


vapor pressure 
partial pressure of a vapor at which it is in equilibrium with the liquid (or solid, in the case of 
sublimation) phase of the same substance 


Heat Capacity and Equipartition of Energy 
By the end of this section, you will be able to: 


¢ Solve problems involving heat transfer to and from ideal monatomic gases 
whose volumes are held constant 

¢ Solve similar problems for non-monatomic ideal gases based on the number of 
degrees of freedom of a molecule 

e Estimate the heat capacities of metals using a model based on degrees of 
freedom 


In the chapter on temperature and heat, we defined the specific heat capacity with the 
equation Q = mcAT, or c = (1/m)Q/AT. However, the properties of an ideal gas 
depend directly on the number of moles in a sample, so here we define specific heat 
capacity in terms of the number of moles, not the mass. Furthermore, when talking 
about solids and liquids, we ignored any changes in volume and pressure with 
changes in temperature—a good approximation for solids and liquids, but for gases, 
we have to make some condition on volume or pressure changes. Here, we focus on 
the heat capacity with the volume held constant. We can calculate it for an ideal gas. 


Heat Capacity of an Ideal Monatomic Gas at Constant Volume 


We define the molar heat capacity at constant volume C'y as 
Equation: 


oe! re 
Cy = 7 Ap? with V held constant. 


This is often expressed in the form 


Note: 
Equation: 


Q = nCyAT. 


If the volume does not change, there is no overall displacement, so no work is done, 
and the only change in internal energy is due to the heat flow AFin, = Q. (This 
statement is discussed further in the next chapter.) We use the equation 


Ein, = 3nRT/2 to write AFin, = 3nRAT'/2 and substitute AF for Q to find 
Q = 3nRAT/2, which gives the following simple result for an ideal monatomic 
gas: 

Equation: 


3 
Cy = rie 


It is independent of temperature, which justifies our use of finite differences instead 
of a derivative. This formula agrees well with experimental results. 


In the next chapter we discuss the molar specific heat at constant pressure C,, which 
is always greater than Cy. 


Example: 

Calculating Temperature 

A sample of 0.125 kg of xenon is contained in a rigid metal cylinder, big enough that 
the xenon can be modeled as an ideal gas, at a temperature of 20.0 °C. The cylinder 
is moved outside on a hot summer day. As the xenon comes into equilibrium by 
reaching the temperature of its surroundings, 180 J of heat are conducted to it 
through the cylinder walls. What is the equilibrium temperature? Ignore the 
expansion of the metal cylinder. 

Solution 


1. Identify the knowns: We know the initial temperature T; is 20.0 °C, the heat Q 
is 180 J, and the mass m of the xenon is 0.125 kg. 

2. Identify the unknown. We need the final temperature, so we’ll need AT. 

3. Determine which equations are needed. Because xenon gas is monatomic, we 
can use Q = 3nRAT/2. Then we need the number of moles, n = m/M. 

4. Substitute the known values into the equations and solve for the unknowns. 
The molar mass of xenon is 131.3 g, so we obtain 
Equation: 


125¢g 
i S= 
131.3 g/mol 


= 0.952 mol, 


Equation: 


Ri occ 
3nR _3(0.952 mol)(8.31 J/mol - °C) 


Therefore, the final temperature is 35.2 “C. The problem could equally well be 
solved in kelvin; as a kelvin is the same size as a degree Celsius of temperature 
change, you would get AT’ = 15.2 K. 


Significance 
The heating of an ideal or almost ideal gas at constant volume is important in car 
engines and many other practical systems. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Suppose 2 moles of helium gas at 200 K are 
mixed with 2 moles of krypton gas at 400 K in a calorimeter. What is the final 
temperature? 


Solution: 


As the number of moles is equal and we know the molar heat capacities of the 
two gases are equal, the temperature is halfway between the initial 
temperatures, 300 K. 


We would like to generalize our results to ideal gases with more than one atom per 
molecule. In such systems, the molecules can have other forms of energy beside 
translational kinetic energy, such as rotational kinetic energy and vibrational kinetic 
and potential energies. We will see that a simple rule lets us determine the average 
energies present in these forms and solve problems in much the same way as we have 
for monatomic gases. 


Degrees of Freedom 


In the previous section, we found that smv? = SkpT and v2 = 3v2, from which it 
follows that =mv2 = +kpT. The same equation holds for v2 and for v2. Thus, we 
can look at our energy of 3kpT as the sum of contributions of $kpT from each of 
the three dimensions of translational motion. Shifting to the gas as a whole, we see 
that the 3 in the formula Cy = 2R also reflects those three dimensions. We define a 
degree of freedom as an independent possible motion of a molecule, such as each of 
the three dimensions of translation. Then, letting d represent the number of degrees of 


freedom, the molar heat capacity at constant volume of a monatomic ideal gas is 
Cyr= <R, where d = 3. 


The branch of physics called statistical mechanics tells us, and experiment confirms, 
that C'y of any ideal gas is given by this equation, regardless of the number of 
degrees of freedom. This fact follows from a more general result, the equipartition 
theorem, which holds in classical (non-quantum) thermodynamics for systems in 
thermal equilibrium under technical conditions that are beyond our scope. Here, we 
mention only that in a system, the energy is shared among the degrees of freedom by 
collisions. 


Note: 

Equipartition Theorem 

The energy of a thermodynamic system in equilibrium is partitioned equally among 
its degrees of freedom. Accordingly, the molar heat capacity of an ideal gas is 
proportional to its number of degrees of freedom, d: 

Equation: 


Cy = a 


This result is due to the Scottish physicist James Clerk Maxwell (1831-1871), whose 
name will appear several more times in this book. 


For example, consider a diatomic ideal gas (a good model for nitrogen, N2, and 
oxygen, Oz). Such a gas has more degrees of freedom than a monatomic gas. In 
addition to the three degrees of freedom for translation, it has two degrees of freedom 
for rotation perpendicular to its axis. Furthermore, the molecule can vibrate along its 
axis. This motion is often modeled by imagining a spring connecting the two atoms, 
and we know from simple harmonic motion that such motion has both kinetic and 


potential energy. Each of these forms of energy corresponds to a degree of freedom, 
giving two more. 


We might expect that for a diatomic gas, we should use 7 as the number of degrees of 
freedom; classically, if the molecules of a gas had only translational kinetic energy, 
collisions between molecules would soon make them rotate and vibrate. However, as 
explained in the previous module, quantum mechanics controls which degrees of 
freedom are active. The result is shown in [link]. Both rotational and vibrational 
energies are limited to discrete values. For temperatures below about 60 K, the 
energies of hydrogen molecules are too low for a collision to bring the rotational state 
or vibrational state of a molecule from the lowest energy to the second lowest, so the 
only form of energy is translational kinetic energy, and d = 3 or Cy = 3R/2asina 
monatomic gas. Above that temperature, the two rotational degrees of freedom begin 
to contribute, that is, some molecules are excited to the rotational state with the 
second-lowest energy. (This temperature is much lower than that where rotations of 
monatomic gases contribute, because diatomic molecules have much higher rotational 
inertias and hence much lower rotational energies.) From about room temperature (a 
bit less than 300 K) to about 600 K, the rotational degrees of freedom are fully active, 
but the vibrational ones are not, and d = 5. Then, finally, above about 3000 K, the 
vibrational degrees of freedom are fully active, and d = 7 as the classical theory 
predicted. 
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The molar heat capacity of hydrogen as a function of temperature (on a 
logarithmic scale). The three “steps” or “plateaus” show different numbers of 
degrees of freedom that the typical energies of molecules must achieve to 
activate. Translational kinetic energy corresponds to three degrees of freedom, 
rotational to another two, and vibrational to yet another two. 


Polyatomic molecules typically have one additional rotational degree of freedom at 
room temperature, since they have comparable moments of inertia around any axis. 
Thus, at room temperature, they have d = 6, and at high temperature, d = 8. We 
usually assume that gases have the theoretical room-temperature values of d. 


As shown in [link], the results agree well with experiments for many monatomic and 
diatomic gases, but the agreement for triatomic gases is only fair. The differences 
arise from interactions that we have ignored between and within molecules. 


Gas Cy /R at 25 °C and 1 atm 
Ar 1.50 

He 1.50 

Ne 1.50 

CO 2.50 

Hp» 2.47 

No 2.50 

O2 2.03 

F, 2.8 


CO, 3.48 


Gas 
H2S 
N2O 


Cy /R at 25 °C and 1 atm 


3.13 


3.66 


Cy / R for Various Monatomic, Diatomic, and Triatomic Gases 


What about internal energy for diatomic and polyatomic gases? For such gases, C'y is 
a function of temperature ([link]), so we do not have the kind of simple result we 


have for monatomic ideal gases. 


Molar Heat Capacity of Solid Elements 


The idea of equipartition leads to an estimate of the molar heat capacity of solid 
elements at ordinary temperatures. We can model the atoms of a solid as attached to 


neighboring atoms by springs ([link]). 
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In a simple model of a solid element, each atom is 


attached to others by six springs, two tor each possible 
motion: x, y, and z. Each of the three motions 
corresponds to two degrees of freedom, one for kinetic 
energy and one for potential energy. Thus d = 6. 


Analogously to the discussion of vibration in the previous module, each atom has six 
degrees of freedom: one kinetic and one potential for each of the x-, y-, and z- 
directions. Accordingly, the molar specific heat of a metal should be 3R. This result, 
known as the Law of Dulong and Petit, works fairly well experimentally at room 
temperature. (For every element, it fails at low temperatures for quantum-mechanical 
reasons. Since quantum effects are particularly important for low-mass particles, the 
Law of Dulong and Petit already fails at room temperature for some light elements, 
such as beryllium and carbon. It also fails for some heavier elements for various 
reasons beyond what we can cover.) 


Note: 

Problem-Solving Strategy: Heat Capacity and Equipartition 

The strategy for solving these problems is the same as the one in Phase Changes for 
the effects of heat transfer. The only new feature is that you should determine 
whether the case just presented—ideal gases at constant volume—applies to the 
problem. (For solid elements, looking up the specific heat capacity is generally better 
than estimating it from the Law of Dulong and Petit.) In the case of an ideal gas, 
determine the number d of degrees of freedom from the number of atoms in the gas 
molecule and use it to calculate Cy (or use C’y to solve for d). 


Example: 

Calculating Temperature: Calorimetry with an Ideal Gas 

A 300-g piece of solid gallium (a metal used in semiconductor devices) at its melting 
point of only 30.0 °C is in contact with 12.0 moles of air (assumed diatomic) at 
95.0 °C in an insulated container. When the air reaches equilibrium with the 
gallium, 202 g of the gallium have melted. Based on those data, what is the heat of 
fusion of gallium? Assume the volume of the air does not change and there are no 
other heat transfers. 

Strategy 

We’ll use the equation Qhot + Qeoia = 0. As some of the gallium doesn’t melt, we 
know the final temperature is still the melting point. Then the only Qyot is the heat 


lost as the air cools, Qnot = NairCy AT, where Cy = 5R/2. The only Qeoig is the 
latent heat of fusion of the gallium, Qcolad = McaLz¢. It is positive because heat 
flows into the gallium. 

Solution 


1. Set up the equation: 
Equation: 


NarCy AT + mea, Lt = 0. 


2. Substitute the known values and solve: 
Equation: 


(12.0 mol) 2) yee (30.0 °C — 95.0 °C) + (0.202 kg) Le = 0. 
2 mol: °C 


We solve to find that the heat of fusion of gallium is 80.2 kJ/kg. 


Summary 


e Every degree of freedom of an ideal gas contributes skpT per atom or 
molecule to its changes in internal energy. 

e Every degree of freedom contributes $R to its molar heat capacity at constant 
volume Cy. 

¢ Degrees of freedom do not contribute if the temperature is too low to excite the 
minimum energy of the degree of freedom as given by quantum mechanics. 
Therefore, at ordinary temperatures, d = 3 for monatomic gases, d = 5 for 
diatomic gases, and d ~ 6 for polyatomic gases. 


Conceptual Questions 


Exercise: 


Problem: 


Experimentally it appears that many polyatomic molecules’ vibrational degrees 
of freedom can contribute to some extent to their energy at room temperature. 
Would you expect that fact to increase or decrease their heat capacity from the 
value R? Explain. 


Exercise: 
Problem: 
One might think that the internal energy of diatomic gases is given by 
Ein, = 5RT/2. Do diatomic gases near room temperature have more or less 
internal energy than that? Hint: Their internal energy includes the total energy 


added in raising the temperature from the boiling point (very low) to room 
temperature. 


Solution: 


Less, because at lower temperatures their heat capacity was only 3RT/2. 
Exercise: 
Problem: 


You mix 5 moles of Hy» at 300 K with 5 moles of He at 360 K in a perfectly 
insulated calorimeter. Is the final temperature higher or lower than 330 K? 


Problems 


Exercise: 


Problem: 


To give a helium atom nonzero angular momentum requires about 21.2 eV of 
energy (that is, 21.2 eV is the difference between the energies of the lowest- 
energy or ground state and the lowest-energy state with angular momentum). 
The electron-volt or eV is defined as 1.60 x 10~!° J. Find the temperature T 
where this amount of energy equals kgT'/2. Does this explain why we can 
ignore the rotational energy of helium for most purposes? (The results for other 
monatomic gases, and for diatomic gases rotating around the axis connecting the 
two atoms, have comparable orders of magnitude.) 


Solution: 


4.92 x 10° K; Yes, that’s an impractically high temperature. 


Exercise: 


Problem: 


(a) How much heat must be added to raise the temperature of 1.5 mol of air from 
25.0 °C to 33.0 °C at constant volume? Assume air is completely diatomic. (b) 
Repeat the problem for the same number of moles of xenon, Xe. 


Exercise: 
Problem: 
A sealed, rigid container of 0.560 mol of an unknown ideal gas at a temperature 


of 30.0 °C is cooled to —40.0 °C. In the process, 980 J of heat are removed 
from the gas. Is the gas monatomic, diatomic, or polyatomic? 


Solution: 


polyatomic 

Exercise: 
Problem: 
A sample of neon gas (Ne, molar mass M = 20.2 g/mol) at a temperature of 
13.0 °C is put into a steel container of mass 47.2 g that’s at a temperature of 
—40.0 °C. The final temperature is—28.0 °C. (No heat is exchanged with the 


surroundings, and you can neglect any change in the volume of the container.) 
What is the mass of the sample of neon? 


Exercise: 
Problem: 
A steel container of mass 135 g contains 24.0 g of ammonia, NH3, which has a 
molar mass of 17.0 g/mol. The container and gas are in equilibrium at 12.0 °C. 


How much heat has to be removed to reach a temperature of —20.0 °C ? Ignore 
the change in volume of the steel. 


Solution: 


3.08 x 10° J 


Exercise: 


Problem: 


A sealed room has a volume of 24 m°. It’s filled with air, which may be 
assumed to be diatomic, at a temperature of 24 °C and a pressure of 

9.83 x 104 Pa. A 1.00-kg block of ice at its melting point is placed in the 
room. Assume the walls of the room transfer no heat. What is the equilibrium 
temperature? 


Exercise: 


Problem: 


Heliox, a mixture of helium and oxygen, is sometimes given to hospital patients 
who have trouble breathing, because the low mass of helium makes it easier to 
breathe than air. Suppose helium at 25 °C is mixed with oxygen at 35 °C to 
make a mixture that is 70 % helium by mole. What is the final temperature? 
Ignore any heat flow to or from the surroundings, and assume the final volume is 
the sum of the initial volumes. 


Solution: 


29.2 °C 
Exercise: 


Problem: 


Professional divers sometimes use heliox, consisting of 79 % helium and 21% 
oxygen by mole. Suppose a perfectly rigid scuba tank with a volume of 11 L 
contains heliox at an absolute pressure of 2.1 x 10’ Pa at a temperature of 

31 °C. (a) How many moles of helium and how many moles of oxygen are in 
the tank? (b) The diver goes down to a point where the sea temperature is 27 “C 
while using a negligible amount of the mixture. As the gas in the tank reaches 
this new temperature, how much heat is removed from it? 


Exercise: 


Problem: 


In car racing, one advantage of mixing liquid nitrous oxide (N2O) with air is 
that the boiling of the “nitrous” absorbs latent heat of vaporization and thus 
cools the air and ultimately the fuel-air mixture, allowing more fuel-air mixture 
to go into each cylinder. As a very rough look at this process, suppose 1.0 mol of 
nitrous oxide gas at its boiling point, —88 °C, is mixed with 4.0 mol of air 
(assumed diatomic) at 30 “C. What is the final temperature of the mixture? Use 
the measured heat capacity of N2O at 25 °C, which is 30.4 J/mol °C. (The 
primary advantage of nitrous oxide is that it consists of 1/3 oxygen, which is 
more than air contains, so it supplies more oxygen to burn the fuel. Another 
advantage is that its decomposition into nitrogen and oxygen releases energy in 
the cylinder.) 


Solution: 


—1.6°C 


Glossary 


degree of freedom 
independent kind of motion possessing energy, such as the kinetic energy of 
motion in one of the three orthogonal spatial directions 


equipartition theorem 
theorem that the energy of a classical thermodynamic system is shared equally 
among its degrees of freedom 


Distribution of Molecular Speeds 
By the end of this section, you will be able to: 


¢ Describe the distribution of molecular speeds in an ideal gas 
e Find the average and most probable molecular speeds in an ideal gas 


Particles in an ideal gas all travel at relatively high speeds, but they do not travel at the same speed. 
The rms speed is one kind of average, but many particles move faster and many move slower. The 
actual distribution of speeds has several interesting implications for other areas of physics, as we will 
see in later chapters. 


The Maxwell-Boltzmann Distribution 


The motion of molecules in a gas is random in magnitude and direction for individual molecules, but 
a gas of many molecules has a predictable distribution of molecular speeds. This predictable 
distribution of molecular speeds is known as the Maxwell-Boltzmann distribution, after its 


originators, who calculated it based on kinetic theory, and it has since been confirmed experimentally 
({link]). 


To understand this figure, we must define a distribution function of molecular speeds, since with a 
finite number of molecules, the probability that a molecule will have exactly a given speed is 0. 


Probability 
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The Maxwell-Boltzmann distribution of molecular speeds in an ideal 
gas. The most likely speed vy is less than the rms speed Uyms. 
Although very high speeds are possible, only a tiny fraction of the 
molecules have speeds that are an order of magnitude greater than 


Urms : 


We define the distribution function f(v) by saying that the expected number N(v1, v2) of particles 
with speeds between v1 and v9 is given by 
Equation: 


N(v1, 02) = nf f(v)dv. 


[Since N is dimensionless, the unit of f(v) is seconds per meter.] We can write this equation 
conveniently in differential form: 
Equation: 


dN = Nf(v)dv. 


In this form, we can understand the equation as saying that the number of molecules with speeds 
between v and v + dv is the total number of molecules in the sample times f(v) times dv. That is, the 
probability that a molecule’s speed is between v and v + dv is f(v)dv. 


We can now quote Maxwell’s result, although the proof is beyond our scope. 


Note: 

Maxwell-Boltzmann Distribution of Speeds 

The distribution function for speeds of particles in an ideal gas at temperature T is 
Equation: 


3/2 
flv) = zai yeh ) Doemruen cist, 
JT 2kpT 


The factors before the v? are a normalization constant; they make sure that N(0, 00) = N by making 
sure that / f(v)dv = 1. Let’s focus on the dependence on v. The factor of v? means that f(0) = 0 
0 


and for small v, the curve looks like a parabola. The factor of e~””/248T means that lim f (v) =0 
U->0Co 


and the graph has an exponential tail, which indicates that a few molecules may move at several times 
the rms speed. The interaction of these factors gives the function the single-peaked shape shown in 
the figure. 


Example: 

Calculating the Ratio of Numbers of Molecules Near Given Speeds 

In a sample of nitrogen (Ng, with a molar mass of 28.0 g/mol) at a temperature of 273 °C, find the 
ratio of the number of molecules with a speed very close to 300 m/s to the number with a speed very 


close to 100 m/s. 

Strategy 

Since we’re looking at a small range, we can approximate the number of molecules near 100 m/s as 
dNi09 = f(100 m/s)dv. Then the ratio we want is 

Equation: 


dN39 _ f(300m/s)dv _ f(300m/s) 


dNio)  f(100m/s)dv —f(100 m/s) * 


All we have to do is take the ratio of the two f values. 
Solution 


1. Identify the knowns and convert to SI units if necessary. 
Equation: 


T= S00K he — ate Wek 


Equation: 
M = 0.0280 kg/molsom = 4.65 x 10°*° kg 


2. Substitute the values and solve. 
Equation: 


#(300 m/s) its * (300 m/s)? exp[—m(300 m/s)?/2ke TI 
HOO at/e) <2 x5 ie (100 m/s)? exp[—m(100 m/s)?/2kpT] 

(300 m/s)” exp[—(4.65 x 10~?° kg) (300 m/s) /2(1.38 x 10-23 J/K)(300 K)| 
(100 m/s)” exp[—(4.65 x 10-6 kg)(100 m/s)” /2(1.38 x 10-23 J/K)(300 K)] 


6 (4.65 x 10-6 kg)[(300 m/s)?—(100 ms)*| 
= 3° exp |- 2(1.38 x 10 = J/K) (300 K) 


= 5.74 


[link] shows that the curve is shifted to higher speeds at higher temperatures, with a broader range of 
speeds. 


Probability 


Speed v (m/s) 


The Maxwell-Boltzmann distribution is 
shifted to higher speeds and broadened at 
higher temperatures. 


Note: 

With only a relatively small number of molecules, the distribution of speeds fluctuates around the 
Maxwell-Boltzmann distribution. However, you can view this simulation to see the essential features 
that more massive molecules move slower and have a narrower distribution. Use the set-up “2 Gases, 
Random Speeds”. Note the display at the bottom comparing histograms of the speed distributions 
with the theoretical curves. 


We can use a probability distribution to calculate average values by multiplying the distribution 
function by the quantity to be averaged and integrating the product over all possible speeds. (This is 
analogous to calculating averages of discrete distributions, where you multiply each value by the 
number of times it occurs, add the results, and divide by the number of values. The integral is 
analogous to the first two steps, and the normalization is analogous to dividing by the number of 
values.) Thus the average velocity is 


Note: 
Equation: 


Pas 7 SE eee 
s= f uflo)dv = Say at = arr 


Similarly, 
Equation: 


Up, is the speed at the peak of the velocity distribution. (In statistics it would be called the mode.) It is 
less than the rms speedv;ms. The most probable speed can be calculated by the more familiar method 
of setting the derivative of the distribution function, with respect to v, equal to 0. The result is 


Note: 
Equation: 


which is less than v;ms. In fact, the rms speed is greater than both the most probable speed and the 
average speed. 


The peak speed provides a sometimes more convenient way to write the Maxwell-Boltzmann 
distribution function: 
Equation: 
2 
Au —y2 [v2 


——e 
V3 


f(v) = 


—my2 Sh : . : A : 
In the factor e~”””/?*8T’ it is easy to recognize the translational kinetic energy. Thus, that expression 


is equal to e~*/*sT, The distribution f(v) can be transformed into a kinetic energy distribution by 
requiring that f(K)dk = f(v)dv. Boltzmann showed that the resulting formula is much more 
generally applicable if we replace the kinetic energy of translation with the total mechanical energy E. 
Boltzmann’s result is 

Equation: 


2) ee eee ee 2 VE 
f(#) = ve (kpT’) V Ee = ValkpT) 2” eE/ksT * 


The first part of this equation, with the negative exponential, is the usual way to write it. We give the 
second part only to remark that e” /ksT in the denominator is ubiquitous in quantum as well as 
classical statistical mechanics. 


Note: 

Problem-Solving Strategy: Speed Distribution 

Step 1. Examine the situation to determine that it relates to the distribution of molecular speeds. 
Step 2. Make a list of what quantities are given or can be inferred from the problem as stated 
(identify the known quantities). 

Step 3. Identify exactly what needs to be determined in the problem (identify the unknown 
quantities). A written list is useful. 

Step 4. Convert known values into proper SI units (K for temperature, Pa for pressure, m® for 
volume, molecules for N, and moles for n). In many cases, though, using R and the molar mass will 
be more convenient than using kg and the molecular mass. 

Step 5. Determine whether you need the distribution function for velocity or the one for energy, and 
whether you are using a formula for one of the characteristic speeds (average, most probably, or 
rms), finding a ratio of values of the distribution function, or approximating an integral. 

Step 6. Solve the appropriate equation for the ideal gas law for the quantity to be determined (the 
unknown quantity). Note that if you are taking a ratio of values of the distribution function, the 
normalization factors divide out. Or if approximating an integral, use the method asked for in the 
problem. 

Step 7. Substitute the known quantities, along with their units, into the appropriate equation and 
obtain numerical solutions complete with units. 


We can now gain a qualitative understanding of a puzzle about the composition of Earth’s 
atmosphere. Hydrogen is by far the most common element in the universe, and helium is by far the 
second-most common. Moreover, helium is constantly produced on Earth by radioactive decay. Why 
are those elements so rare in our atmosphere? The answer is that gas molecules that reach speeds 
above Earth’s escape velocity, about 11 km/s, can escape from the atmosphere into space. Because of 
the lower mass of hydrogen and helium molecules, they move at higher speeds than other gas 
molecules, such as nitrogen and oxygen. Only a few exceed escape velocity, but far fewer heavier 
molecules do. Thus, over the billions of years that Earth has existed, far more hydrogen and helium 
molecules have escaped from the atmosphere than other molecules, and hardly any of either is now 
present. 


We can also now take another look at evaporative cooling, which we discussed in the chapter on 
temperature and heat. Liquids, like gases, have a distribution of molecular energies. The highest- 
energy molecules are those that can escape from the intermolecular attractions of the liquid. Thus, 
when some liquid evaporates, the molecules left behind have a lower average energy, and the liquid 
has a lower temperature. 


Summary 


¢ The motion of individual molecules in a gas is random in magnitude and direction. However, a 
gas of many molecules has a predictable distribution of molecular speeds, known as the 
Maxwell-Boltzmann distribution. 

e The average and most probable velocities of molecules having the Maxwell-Boltzmann speed 
distribution, as well as the rms velocity, can be calculated from the temperature and molecular 
mass. 


Key Equations 


Ideal gas law in terms of molecules 
Ideal gas law ratios if the amount of gas is constant 


Ideal gas law in terms of moles 


Van der Waals equation 


Pressure, volume, and molecular speed 
Root-mean-square speed 


Mean free path 


Mean free time 


The following two equations apply only to a 
monatomic ideal gas: 


Average kinetic energy of a molecule 


Internal energy 


Heat in terms of molar heat capacity at constant 
volume 


Molar heat capacity at constant volume for an ideal 
gas with d degrees of freedom 


Maxwell—Boltzmann speed distribution 


Average velocity of a molecule 


Peak velocity of a molecule 


Conceptual Questions 


Exercise: 
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Problem: 


One cylinder contains helium gas and another contains krypton gas at the same temperature. 
Mark each of these statements true, false, or impossible to determine from the given information. 
(a) The rms speeds of atoms in the two gases are the same. (b) The average kinetic energies of 
atoms in the two gases are the same. (c) The internal energies of 1 mole of gas in each cylinder 
are the same. (d) The pressures in the two cylinders are the same. 


Solution: 


a. false; b. true; c. true; d. true 
Exercise: 


Problem: 


Repeat the previous question if one gas is still helium but the other is changed to fluorine, F2. 
Exercise: 
Problem: 


An ideal gas is at a temperature of 300 K. To double the average speed of its molecules, what 
does the temperature need to be changed to? 


Solution: 


1200 K 


Problems 


Exercise: 
Problem: 
In a sample of hydrogen sulfide (IZ = 34.1 g/mol) at a temperature of 3.00 x 10? Kk, 


estimate the ratio of the number of molecules that have speeds very close to v;ms to the number 
that have speeds very close to 20yms- 


Exercise: 


Problem: 


vi +Av 
Using the approximation / f(v)dv = f(v1)Av for small Av, estimate the fraction of 


v1 
nitrogen molecules at a temperature of 3.00 x 107 K that have speeds between 290 m/s and 
291 m/s. 


Solution: 


0.00157 


Exercise: 


Problem: 


Using the method of the preceding problem, estimate the fraction of nitric oxide (NO) molecules 
at a temperature of 250 K that have energies between 3.45 x 10-7! J and3.50 x 10°77! J. 


Exercise: 


Problem: 


By counting squares in the following figure, estimate the fraction of argon atoms at T = 300 K 
that have speeds between 600 m/s and 800 m/s. The curve is correctly normalized. The value of 
a square is its length as measured on the x-axis times its height as measured on the y-axis, with 
the units given on those axes. 
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Solution: 


About 0.072. Answers may vary slightly. A more accurate answer is 0.074. 
Exercise: 
Problem: 
Using a numerical integration method such as Simpson’s rule, find the fraction of molecules in a 


sample of oxygen gas at a temperature of 250 K that have speeds between 100 m/s and 150 m/s. 
The molar mass of oxygen (Og) is 32.0 g/mol. A precision to two significant digits is enough. 


Exercise: 


Problem: 


Find (a) the most probable speed, (b) the average speed, and (c) the rms speed for nitrogen 
molecules at 295 K. 


Solution: 


a. 419 m/s; b. 472 m/s; c. 513 m/s 


Exercise: 


Problem: Repeat the preceding problem for nitrogen molecules at 2950 K. 
Exercise: 
Problem: 


At what temperature is the average speed of carbon dioxide molecules (M = 44.0 g/mol) 510 
m/s? 


Solution: 


541 K 
Exercise: 
Problem: 
The most probable speed for molecules of a gas at 296 K is 263 m/s. What is the molar mass of 
the gas? (You might like to figure out what the gas is likely to be.) 
Exercise: 
Problem: 
a) At what temperature do oxygen molecules have the same average speed as helium atoms 


(M = 4.00 g/mol) have at 300 K? b) What is the answer to the same question about most 
probable speeds? c) What is the answer to the same question about rms speeds? 


Solution: 


2400 K for all three parts 


Additional Problems 


Exercise: 
Problem: 
In the deep space between galaxies, the density of molecules (which are mostly single atoms) 
can be as low as 10° atoms / m’*, and the temperature is a frigid 2.7 K. What is the pressure? (b) 


What volume (in m?) is occupied by 1 mol of gas? (c) If this volume is a cube, what is the 
length of its sides in kilometers? 


Exercise: 
Problem: 
(a) Find the density in SI units of air at a pressure of 1.00 atm and a temperature of 20 °C, 
assuming that air is 78% N2, 21% O2, and 1% Ar, (b) Find the density of the atmosphere on 


Venus, assuming that it’s 96% CO, and 4% No, with a temperature of 737 K and a pressure of 
92.0 atm. 


Solution: 


a. 1.20kg/m®; b. 65.9kg/m® 
Exercise: 
Problem: 
The air inside a hot-air balloon has a temperature of 370 K and a pressure of 101.3 kPa, the same 


as that of the air outside. Using the composition of air as 78% N2,21%O2, and 1% Ar, find the 
density of the air inside the balloon. 


Exercise: 
Problem: 
When an air bubble rises from the bottom to the top of a freshwater lake, its volume increases by 


80 %. If the temperatures at the bottom and the top of the lake are 4.0 and 10 °C, respectively, 
how deep is the lake? 


Solution: 


7.9m 

Exercise: 
Problem: 
(a) Use the ideal gas equation to estimate the temperature at which 1.00 kg of steam (molar mass 
M = 18.0 g/mol) at a pressure of 1.50 x 10° Pa occupies a volume of 0.220 m?. (b) The van 
der Waals constants for water are a = 0.5537 Pa- m®/mol” and b = 3.049 x 10°-° m?/mol. 


Use the Van der Waals equation of state to estimate the temperature under the same conditions. 
(c) The actual temperature is 779 K. Which estimate is better? 


Exercise: 


Problem: 


One process for decaffeinating coffee uses carbon dioxide (M = 44.0 g/mol) at a molar 
density of about 14,600 mol/ m’ anda temperature of about 60 °C. (a) Is CO2 a solid, liquid, 
gas, or supercritical fluid under those conditions? (b) The van der Waals constants for carbon 
dioxide are a = 0.3658 Pa - m°/mol? and b = 4.286 x 10~° m?/mol. Using the van der 
Waals equation, estimate the pressure of COg at that temperature and density. 


Solution: 


a. supercritical fluid; b. 3.00 x 10’ Pa 
Exercise: 
Problem: 
On a winter day when the air temperature is 0 °C, the relative humidity is 50 %. Outside air 


comes inside and is heated to a room temperature of 20 °C. What is the relative humidity of the 
air inside the room. (Does this problem show why inside air is so dry in winter?) 


Exercise: 


Problem: 


On a warm day when the air temperature is 30 °C, a metal can is slowly cooled by adding bits of 
ice to liquid water in it. Condensation first appears when the can reaches 15 °C. What is the 
relative humidity of the air? 


Solution: 


40.18% 
Exercise: 


Problem: 


(a) People often think of humid air as “heavy.” Compare the densities of air with 0 % relative 
humidity and 100 % relative humidity when both are at 1 atm and 30 °C. Assume that the dry air 
is an ideal gas composed of molecules with a molar mass of 29.0 g/mol and the moist air is the 
same gas mixed with water vapor. (b) As discussed in the chapter on the applications of 
Newton’s laws, the air resistance felt by projectiles such as baseballs and golf balls is 
approximately Fp = CpAv?/2, where p is the mass density of the air, A is the cross-sectional 
area of the projectile, and C is the projectile’s drag coefficient. For a fixed air pressure, describe 
qualitatively how the range of a projectile changes with the relative humidity. (c) When a 
thunderstorm is coming, usually the humidity is high and the air pressure is low. Do those 
conditions give an advantage or disadvantage to home-run hitters? 


Exercise: 
Problem: 
The mean free path for helium at a certain temperature and pressure is 2.10 x 10~’ m. The 
radius of a helium atom can be taken as 1.10 x 10!’ m. What is the measure of the density of 


helium under those conditions (a) in molecules per cubic meter and (b) in moles per cubic 
meter? 


Solution: 


asd le 0°" molecules /m’; b. 3.67 x 103 mol /m* 
Exercise: 
Problem: 
The mean free path for methane at a temperature of 269 K and a pressure of 1.11 x 10° Pais 
4.81 x 10° m. Find the effective radius r of the methane molecule. 


Exercise: 


Problem: 


In the chapter on fluid mechanics, Bernoulli’s equation for the flow of incompressible fluids was 
explained in terms of changes affecting a small volume dV of fluid. Such volumes are a 
fundamental idea in the study of the flow of compressible fluids such as gases as well. For the 
equations of hydrodynamics to apply, the mean free path must be much less than the linear size 
of such a volume, a + dV”. For air in the stratosphere at a temperature of 220 K anda 
pressure of 5.8 kPa, how big should a be for it to be 100 times the mean free path? Take the 
effective radius of air molecules to be 1.88 x 10~' m, which is roughly correct for Ny». 


Solution: 


8.2 mm 

Exercise: 
Problem: 
Find the total number of collisions between molecules in 1.00 s in 1.00 L of nitrogen gas at 
standard temperature and pressure (0 °C, 1.00 atm). Use 1.88 x 10~'° mas the effective 
radius of a nitrogen molecule. (The number of collisions per second is the reciprocal of the 


collision time.) Keep in mind that each collision involves two molecules, so if one molecule 
collides once in a certain period of time, the collision of the molecule it hit cannot be counted. 


Exercise: 


Problem: 


(a) Estimate the specific heat capacity of sodium from the Law of Dulong and Petit. The molar 
mass of sodium is 23.0 g/mol. (b) What is the percent error of your estimate from the known 
value, 1230 J/kg - °C? 


Solution: 


a. 1080 J/kg °C; b. 12% 
Exercise: 


Problem: 


A sealed, perfectly insulated container contains 0.630 mol of air at 20.0 °C and an iron stirring 
bar of mass 40.0 g. The stirring bar is magnetically driven to a kinetic energy of 50.0 J and 
allowed to slow down by air resistance. What is the equilibrium temperature? 


Exercise: 


Problem: 


Find the ratio f(vp)/f (rms) for hydrogen gas (IM = 2.02 g/mol) at a temperature of 77.0 K. 


Solution: 


2,/e/3 or about 1.10 


Exercise: 


Problem: 


Unreasonable results. (a) Find the temperature of 0.360 kg of water, modeled as an ideal gas, at 
a pressure of 1.01 x 10° Paif it has a volume of 0.615 m?. (b) What is unreasonable about 
this answer? How could you get a better answer? 


Exercise: 
Problem: 
Unreasonable results. (a) Find the average speed of hydrogen sulfide, H25, molecules at a 


temperature of 250 K. Its molar mass is 31.4 g/mol (b) The result isn’t very unreasonable, but 
why is it less reliable than those for, say, neon or nitrogen? 


Solution: 


a. 411 m/s; b. According to [link], the Cy of H2S is significantly different from the theoretical 
value, so the ideal gas model does not describe it very well at room temperature and pressure, 
and the Maxwell-Boltzmann speed distribution for ideal gases may not hold very well, even less 
well at a lower temperature. 


Challenge Problems 


Exercise: 


Problem: 


An airtight dispenser for drinking water is 25cm x 10cm in horizontal dimensions and 20 cm 
tall. It has a tap of negligible volume that opens at the level of the bottom of the dispenser. 
Initially, it contains water to a level 3.0 cm from the top and air at the ambient pressure, 1.00 
atm, from there to the top. When the tap is opened, water will flow out until the gauge pressure 
at the bottom of the dispenser, and thus at the opening of the tap, is 0. What volume of water 
flows out? Assume the temperature is constant, the dispenser is perfectly rigid, and the water has 
a constant density of 1000 kg/ m°. 


Exercise: 
Problem: 


Eight bumper cars, each with a mass of 322 kg, are running in a room 21.0 m long and 13.0 m 
wide. They have no drivers, so they just bounce around on their own. The rms speed of the cars 


average force per unit length (analogous to pressure) that the cars exert on the walls. 
Solution: 


29.5 N/m 


Exercise: 


Problem: Verify that v, = 4/ es 


Exercise: 
Problem: 
Verify the normalization equation / f(v)dv = 1. In doing the integral, first make the 
0 


substitution u = ,/s-7u = <.. This “scaling” transformation gives you all features of the 
B Up 


answer except for the integral, which is a dimensionless numerical factor. You’ll need the 
formula 


. xe dx = 
0 


to find the numerical factor and verify the normalization. 


Solution: 


Substituting v = 4/ aa. u and du = / wae du gives 


foe) 3/2 oO 3/2 
| (= ; vie mM /AksT dy =| (= pes we ahr du 
0 Vn \ 2kpT 0 Va \ 2kpgT m m 


ms Vr 4) 
Exercise: 
Problem: 
Verify that 0 = on ‘eT’ Make the same scaling transformation as in the preceding problem. 
Exercise: 


Problem: Verify that v:ms = ,E: = / Set 


Solution: 


Making the scaling transformation as in the previous problems, we find that 


: 00 3/2 00 
v= i ; ( = ) yevr2e Me /2kBT dy = ee ohet ute du. 
0 nm \ 2kpT 0 Vr m 


As in the previous problem, we integrate by parts: 


CO 5 CO 
- u'e” du= |- 7 Lipew le -+ 3 we du. 
0 0 0 


Again, the first term is 0, and we were given in an earlier problem that the integral in the second 


term equals “ V™ We now have 

2 _4 kyl 3 VT _ 3kpT 

—— = => . 
JT mM 2 #4 m 


Taking the square root of both sides gives the desired result: Yrms = / =, 


Glossary 


Maxwell-Boltzmann distribution 
function that can be integrated to give the probability of finding ideal gas molecules with speeds 
in the range between the limits of integration 


most probable speed 
speed near which the speeds of most molecules are found, the peak of the speed distribution 
function 


peak speed 
same as “most probable speed” 


Introduction 
class="introduction" 


A weak cold 
front of air 
pushes all the 
smog in 
northeastern 
China into a 
giant smog 
blanket over the 
Yellow Sea, as 
captured by 
NASA’s Terra 
satellite in 2012. 
To understand 
changes in 
weather and 
climate, such as 
the event shown 
here, you need a 
thorough 
knowledge of 
thermodynamics 
. (credit: 
modification of 
work by NASA) 


Heat is the transfer of energy due to a temperature difference between two 
systems. Heat describes the process of converting from one form of energy 
into another. A car engine, for example, burns gasoline. Heat is produced 
when the burned fuel is chemically transformed into mostly CO, and HO, 
which are gases at the combustion temperature. These gases exert a force on 
a piston through a displacement, doing work and converting the piston’s 
kinetic energy into a variety of other forms—into the car’s kinetic energy; 
into electrical energy to run the spark plugs, radio, and lights; and back into 
stored energy in the car’s battery. 


Energy is conserved in all processes, including those associated with 
thermodynamic systems. The roles of heat transfer and internal energy 
change vary from process to process and affect how work is done by the 
system in that process. We will see that the first law of thermodynamics 
explains that a change in the internal energy of a system comes from 
changes in heat or work. Understanding the laws that govern 
thermodynamic processes and the relationship between the system and its 
surroundings is therefore paramount in gaining scientific knowledge of 
energy and energy consumption. 


Thermodynamic Systems 
By the end of this section, you will be able to: 


¢ Define a thermodynamic system, its boundary, and its surroundings 
e Explain the roles of all the components involved in thermodynamics 
¢ Define thermal equilibrium and thermodynamic temperature 

e Link an equation of state to a system 


A thermodynamic system includes anything whose thermodynamic 
properties are of interest. It is embedded in its surroundings or 
environment; it can exchange heat with, and do work on, its environment 
through a boundary, which is the imagined wall that separates the system 
and the environment ([link]). In reality, the immediate surroundings of the 
system are interacting with it directly and therefore have a much stronger 
influence on its behavior and properties. For example, if we are studying a 
car engine, the burning gasoline inside the cylinder of the engine is the 
thermodynamic system; the piston, exhaust system, radiator, and air outside 
form the surroundings of the system. The boundary then consists of the 
inner surfaces of the cylinder and piston. 


Surroundings 


Piston Boundary 


Surroundings 
Surroundings 


Surroundings Boundary Surroundings 


(a) (b) 


(a) A system, which can include any relevant process or value, is self- 
contained in an area. The surroundings may also have relevant 
information; however, the surroundings are important to study only if 
the situation is an open system. (b) The burning gasoline in the 
cylinder of a car engine is an example of a thermodynamic system. 


Normally, a system must have some interactions with its surroundings. A 
system is called an isolated and closed system if it is completely separated 
from its environment—for example, a gas that is surrounded by immovable 
and thermally insulating walls. In reality, a closed system does not exist 
unless the entire universe is treated as the system, or it is used as a model 
for an actual system that has minimal interactions with its environment. 
Most systems are known as an open system, which can exchange energy 
and/or matter with its surroundings ([link]). 


(a) (b) 


(a) This boiling tea kettle is an open thermodynamic system. It 
transfers heat and matter (steam) to its surroundings. (b) A pressure 
cooker is a good approximation to a closed system. A little steam 
escapes through the top valve to prevent explosion. (credit a: 
modification of work by Gina Hamilton; credit b: modification of 
work by Jane Whitney) 


When we examine a thermodynamic system, we ignore the difference in 
behavior from place to place inside the system for a given moment. In other 
words, we concentrate on the macroscopic properties of the system, which 
are the averages of the microscopic properties of all the molecules or 
entities in the system. Any thermodynamic system is therefore treated as a 
continuum that has the same behavior everywhere inside. We assume the 
system is in equilibrium. You could have, for example, a temperature 
gradient across the system. However, when we discuss a thermodynamic 
system in this chapter, we study those that have uniform properties 
throughout the system. 


Before we can carry out any study on a thermodynamic system, we need a 
fundamental characterization of the system. When we studied a mechanical 
system, we focused on the forces and torques on the system, and their 
balances dictated the mechanical equilibrium of the system. In a similar 
way, we should examine the heat transfer between a thermodynamic system 
and its environment or between the different parts of the system, and its 
balance should dictate the thermal equilibrium of the system. Intuitively, 
such a balance is reached if the temperature becomes the same for different 
objects or parts of the system in thermal contact, and the net heat transfer 
over time becomes zero. 


Thus, when we say two objects (a thermodynamic system and its 
environment, for example) are in thermal equilibrium, we mean that they 
are at the same temperature, as we discussed in Temperature and Heat. Let 
us consider three objects at temperatures 7’) JY’ and 7’ respectively. How 
do we know whether they are in thermal equilibrium? The governing 
principle here is the zeroth law of thermodynamics, as described in 
Temperature and Heat on temperature and heat: 


If object 1 is in thermal equilibrium with objects 2 and 3, respectively, then 
objects 2 and 3 must also be in thermal equilibrium. 


Mathematically, we can simply write the zeroth law of thermodynamics as 
Equation: 


T T T T T T 


This is the most fundamental way of defining temperature: Two objects 
must be at the same temperature thermodynamically if the net heat transfer 
between them is zero when they are put in thermal contact and have reached 
a thermal equilibrium. 


The zeroth law of thermodynamics is equally applicable to the different 
parts of a closed system and requires that the temperature everywhere inside 
the system be the same if the system has reached a thermal equilibrium. To 
simplify our discussion, we assume the system is uniform with only one 
type of material—for example, water in a tank. The measurable properties 
of the system at least include its volume, pressure, and temperature. The 
range of specific relevant variables depends upon the system. For example, 
for a stretched rubber band, the relevant variables would be length, tension, 
and temperature. The relationship between these three basic properties of 
the system is called the equation of state of the system and is written 
symbolically for a closed system as 


Note: 
Equation: 


pea as 


where V, p, and T are the volume, pressure, and temperature of the system 
at a given condition. 


In principle, this equation of state exists for any thermodynamic system but 
is not always readily available. The forms of f p V T for many 
materials have been determined either experimentally or theoretically. In the 
preceding chapter, we saw an example of an equation of state for an ideal 
gas, fpVT pV nRT 


We have so far introduced several physical properties that are relevant to 
the thermodynamics of a thermodynamic system, such as its volume, 


pressure, and temperature. We can separate these quantities into two generic 
categories. The quantity associated with an amount of matter is an 
extensive variable, such as the volume and the number of moles. The other 
properties of a system are intensive variables, such as the pressure and 
temperature. An extensive variable doubles its value if the amount of matter 
in the system doubles, provided all the intensive variables remain the same. 
For example, the volume or total energy of the system doubles if we double 
the amount of matter in the system while holding the temperature and 
pressure of the system unchanged. 


Summary 


e A thermodynamic system, its boundary, and its surroundings must be 
defined with all the roles of the components fully explained before we 
can analyze a situation. 

e Thermal equilibrium is reached with two objects if a third object is in 
thermal equilibrium with the other two separately. 

e A general equation of state for a closed system has the form 
fpVT with an ideal gas as an illustrative example. 


Conceptual Questions 


Exercise: 


Problem: 


Consider these scenarios and state whether work is done by the system 
on the environment (SE) or by the environment on the system (ES): (a) 
opening a carbonated beverage; (b) filling a flat tire; (c) a sealed empty 
gas can expands on a hot day, bowing out the walls. 


Solution: 


a. SE; b. ES; c. ES 


Problems 


Exercise: 


Problem: 


A gas follows pV bp cr onan isothermal curve, where p is the 
pressure, V is the volume, b is a constant, and c is a function of 
temperature. Show that a temperature scale under an isochoric process 
can be established with this gas and is identical to that of an ideal gas. 


Solution: 
pV 0b cr is the temperature scale desired and mirrors the ideal 
gas if under constant volume. 
Exercise: 
Problem: 


A mole of gas has isobaric expansion coefficient dV dZ’ R pand 
isochoric pressure-temperature coefficient dp dT’ p T. Find the 
equation of state of the gas. 


Exercise: 
Problem: 
Find the equation of state of a solid that has an isobaric expansion 


coefficientdV dT cI’ bp and an isothermal pressure-volume 
coefficient dV dp bT 


Solution: 


VY bof ef 


Glossary 


boundary 
imagined walls that separate the system and its surroundings 


closed system 


system that is mechanically and thermally isolated from its 
environment 


environment 
outside of the system being studied 


equation of state 
describes properties of matter under given physical conditions 


equilibrium 
thermal balance established between two objects or parts within a 
system 


extensive variable 
variable that is proportional to the amount of matter in the system 


intensive variable 
variable that is independent of the amount of matter in the system 


open system 
system that can exchange energy and/or matter with its surroundings 


surroundings 
environment that interacts with an open system 


thermodynamic system 
object and focus of thermodynamic study 


Work, Heat, and Internal Energy 
By the end of this section, you will be able to: 


e Describe the work done by a system, heat transfer between objects, 
and internal energy change of a system 

¢ Calculate the work, heat transfer, and internal energy change in a 
simple process 


We discussed the concepts of work and energy earlier in mechanics. 
Examples and related issues of heat transfer between different objects have 
also been discussed in the preceding chapters. Here, we want to expand 
these concepts to a thermodynamic system and its environment. 
Specifically, we elaborated on the concepts of heat and heat transfer in the 
previous two chapters. Here, we want to understand how work is done by or 
to a thermodynamic system; how heat is transferred between a system and 
its environment; and how the total energy of the system changes under the 
influence of the work done and heat transfer. 


Work Done by a System 


A force created from any source can do work by moving an object through 
a displacement. Then how does a thermodynamic system do work? [link] 
shows a gas confined to a cylinder that has a movable piston at one end. If 
the gas expands against the piston, it exerts a force through a distance and 
does work on the piston. If the piston compresses the gas as it is moved 
inward, work is also done—in this case, on the gas. The work associated 
with such volume changes can be determined as follows: Let the gas 
pressure on the piston face be p. Then the force on the piston due to the gas 
is pA, where A is the area of the face. When the piston is pushed outward an 
infinitesimal distance dx, the magnitude of the work done by the gas is 
Equation: 


dW = F dz = pAdz. 


Since the change in volume of the gas is dV = A dz, this becomes 
Equation: 


dW = pdV. 


For a finite change in volume from Vj to V2, we can integrate this equation 
from V, to V2 to find the net work: 


Note: 
Equation: 


The work done by a confined gas in 
moving a piston a distance dx is given by 
dW = Fdz = pdV. 


This integral is only meaningful for a quasi-static process, which means a 
process that takes place in infinitesimally small steps, keeping the system at 


thermal equilibrium. (We examine this idea in more detail later in this 
chapter.) Only then does a well-defined mathematical relationship (the 
equation of state) exist between the pressure and volume. This relationship 
can be plotted on a pV diagram of pressure versus volume, where the curve 
is the change of state. We can approximate such a process as one that occurs 
slowly, through a series of equilibrium states. The integral is interpreted 
graphically as the area under the pV curve (the shaded area of [link]). Work 
done by the gas is positive for expansion and negative for compression. 


When a gas expands slowly from 
V, to V2, the work done by the system is 
represented by the shaded area under the 
pV curve. 


Consider the two processes involving an ideal gas that are represented by 
paths AC and ABC in [link]. The first process is an isothermal expansion, 
with the volume of the gas changing its volume from V,; to V2. This 
isothermal process is represented by the curve between points A and C. The 
gas is kept at a constant temperature T by keeping it in thermal equilibrium 
with a heat reservoir at that temperature. From [link] and the ideal gas law, 


Equation: 


Vy V2 : 


The paths ABC, AC, and ADC represent 
three different quasi-static transitions 
between the equilibrium states A and C. 


The expansion is isothermal, so T remains constant over the entire process. 
Since n and R are also constant, the only variable in the integrand is V, so 


the work done by an ideal gas in an isothermal process is 
Equation: 


V2 
W= ne | oe = nRTIn~2. 
ye Vv Vi 


Notice that if V2 > V; (expansion), W is positive, as expected. 


The straight lines from A to B and then from B to C represent a different 
process. Here, a gas at a pressure p first expands isobarically (constant 
pressure) and quasi-statically from V; to V2, after which it cools quasi- 
statically at the constant volume V2 until its pressure drops to p2. From A to 
B, the pressure is constant at p, so the work over this part of the path is 
Equation: 


Vo Vo 
W= paV =p, | dV = p1(V, — Vj). 
Vi Vi 


From B to C, there is no change in volume and therefore no work is done. 
The net work over the path ABC is then 
Equation: 


W = pi (V2 — Vi) + 0 = pi (V2 — Vi). 


A comparison of the expressions for the work done by the gas in the two 
processes of [link] shows that they are quite different. This illustrates a very 
important property of thermodynamic work: It is path dependent. We 
cannot determine the work done by a system as it goes from one 
equilibrium state to another unless we know its thermodynamic path. 
Different values of the work are associated with different paths. 


Example: 

Isothermal Expansion of a van der Waals Gas 

Studies of a van der Waals gas require an adjustment to the ideal gas law 
that takes into consideration that gas molecules have a definite volume (see 
The Kinetic Theory of Gases). One mole of a van der Waals gas has an 
equation of state 

Equation: 


a 
(oe eee 
where a and b are two parameters for a specific gas. Suppose the gas 
expands isothermally and quasi-statically from volume V; to volume V3. 
How much work is done by the gas during the expansion? 

Strategy 

Because the equation of state is given, we can use [link] to express the 
pressure in terms of V and TJ. Furthermore, temperature T is a constant 
under the isothermal condition, so V becomes the only changing variable 
under the integral. 

Solution 

To evaluate this integral, we must express p as a function of V. From the 
given equation of state, the gas pressure is 

Equation: 


Because T is constant under the isothermal condition, the work done by 1 
mol of a van der Waals gas in expanding from a volume Vj to a volume V2 
is thus 

Equation: 


i 


V2 
i 
W = (Fo) |RTIn(V — b) + #|y 


V2—b 1 1 
= RTIn (>) +a(¢ a x). 


Significance 

By taking into account the volume of molecules, the expression for work is 
much more complex. If, however, we set a = 0 and b = 0, we see that the 
expression for work matches exactly the work done by an isothermal 
process for one mole of an ideal gas. 


Note: 
Exercise: 


Problem: 


Check Your Understanding How much work is done by the gas, as 
given in [link], when it expands quasi-statically along the path ADC? 


Solution: 


p2(V2 — V1) 


Internal Energy 


The internal energy F;,; of a thermodynamic system is, by definition, the 
sum of the mechanical energies of all the molecules or entities in the 
system. If the kinetic and potential energies of molecule i are K; and U;, 
respectively, then the internal energy of the system is the average of the 
total mechanical energy of all the entities: 


Note: 
Equation: 


Ein, = So(K: 4: Ui), 


4 


where the summation is over all the molecules of the system, and the bars 
over K and U indicate average values. The kinetic energy AK; of an 
individual molecule includes contributions due to its rotation and vibration, 
as well as its translational energy mv? /2, where v; is the molecule’s speed 
measured relative to the center of mass of the system. The potential energy 


U; is associated only with the interactions between molecule i and the other 
molecules of the system. In fact, neither the system’s location nor its motion 
is of any consequence as far as the internal energy is concerned. The 
internal energy of the system is not affected by moving it from the 
basement to the roof of a 100-story building or by placing it on a moving 
train. 


In an ideal monatomic gas, each molecule is a single atom. Consequently, 
there is no rotational or vibrational kinetic energy and kK; = miv? / 2. 
Furthermore, there are no interatomic interactions (collisions 
notwithstanding), so U; = constant, which we set to zero. The internal 
energy is therefore due to translational kinetic energy only and 
Equation: 


Ein = S* Ki = >, mint. 
i i 


From the discussion in the preceding chapter, we know that the average 
kinetic energy of a molecule in an ideal monatomic gas is 
Equation: 


where T is the Kelvin temperature of the gas. Consequently, the average 
mechanical energy per molecule of an ideal monatomic gas is also 3kgT'/2, 
that is, 

Equation: 


este. & 3 
K,+U; = K; = Pu 
The internal energy is just the number of molecules multiplied by the 


average mechanical energy per molecule. Thus for n moles of an ideal 
monatomic gas, 


Note: 
Equation: 


3 
Fin, = 2NA (#07) = <n RT. 


Notice that the internal energy of a given quantity of an ideal monatomic 
gas depends on just the temperature and is completely independent of the 
pressure and volume of the gas. For other systems, the internal energy 
cannot be expressed so simply. However, an increase in internal energy can 
often be associated with an increase in temperature. 


We know from the zeroth law of thermodynamics that when two systems 
are placed in thermal contact, they eventually reach thermal equilibrium, at 
which point they are at the same temperature. As an example, suppose we 
mix two monatomic ideal gases. Now, the energy per molecule of an ideal 
monatomic gas is proportional to its temperature. Thus, when the two gases 
are mixed, the molecules of the hotter gas must lose energy and the 
molecules of the colder gas must gain energy. This continues until thermal 
equilibrium is reached, at which point, the temperature, and therefore the 
average translational kinetic energy per molecule, is the same for both 
gases. The approach to equilibrium for real systems is somewhat more 
complicated than for an ideal monatomic gas. Nevertheless, we can still say 
that energy is exchanged between the systems until their temperatures are 
the same. 


Summary 


e Positive (negative) work is done by a thermodynamic system when it 
expands (contracts) under an external pressure. 

¢ Heat is the energy transferred between two objects (or two parts of a 
system) because of a temperature difference. 

e Internal energy of a thermodynamic system is its total mechanical 
energy. 


Conceptual Questions 


Exercise: 
Problem: 
Is it possible to determine whether a change in internal energy is 


caused by heat transferred, by work performed, or by a combination of 
the two? 


Exercise: 
Problem: 


When a liquid is vaporized, its change in internal energy is not equal to 
the heat added. Why? 


Solution: 


Some of the energy goes into changing the phase of the liquid to gas. 
Exercise: 


Problem: 


Why does a bicycle pump feel warm as you inflate your tire? 
Exercise: 
Problem: 


Is it possible for the temperature of a system to remain constant when 
heat flows into or out of it? If so, give examples. 


Solution: 


Yes, as long as the work done equals the heat added there will be no 
change in internal energy and thereby no change in temperature. When 
water freezes or when ice melts while removing or adding heat, 
respectively, the temperature remains constant. 


Problems 


Exercise: 
Problem: 
A gas at a pressure of 2.00 atm undergoes a quasi-static isobaric 
expansion from 3.00 to 5.00 L. How much work is done by the gas? 
Exercise: 
Problem: 
It takes 500 J of work to compress quasi-statically 0.50 mol of an ideal 


gas to one-fifth its original volume. Calculate the temperature of the 
gas, assuming it remains constant during the compression. 


Solution: 


74K 
Exercise: 
Problem: 
It is found that, when a dilute gas expands quasi-statically from 0.50 to 


4.0 L, it does 250 J of work. Assuming that the gas temperature 
remains constant at 300 K, how many moles of gas are present? 


Exercise: 
Problem: 
In a quasi-static isobaric expansion, 500 J of work are done by the gas. 


If the gas pressure is 0.80 atm, what is the fractional increase in the 
volume of the gas, assuming it was originally at 20.0 L? 


Solution: 


1.4 times 


Exercise: 


Problem: 


When a gas undergoes a quasi-static isobaric change in volume from 
10.0 to 2.0 L, 15 J of work from an external source are required. What 
is the pressure of the gas? 


Exercise: 


Problem: 


An ideal gas expands quasi-statically and isothermally from a state 
with pressure p and volume V to a state with volume 4V. Show that the 
work done by the gas in the expansion is pV(In 4). 


Solution: 


pVIn(4) 
Exercise: 


Problem: 


As shown below, calculate the work done by the gas in the quasi-static 
processes represented by the paths (a) AB; (b) ADB; (c) ACB; and (d) 
ADCB. 


p (atm) 
4.0 
3.0 
2.0 


1.0 


Exercise: 


Problem: 


(a) Calculate the work done on the gas along the closed path shown 
below. The curved section between R and S is semicircular. (b) If the 
process is carried out in the opposite direction, what is the work done 
on the gas? 


p (atm) 
4.0 


3.0 


10 20 30 40 50) 


Solution: 


a. 160 J; b. -160 J 
Exercise: 
Problem: 
An ideal gas expands quasi-statically to three times its original 
volume. Which process requires more work from the gas, an 


isothermal process or an isobaric one? Determine the ratio of the work 
done in these processes. 


Exercise: 


Problem: 


A dilute gas at a pressure of 2.0 atm and a volume of 4.0 L is taken 
through the following quasi-static steps: (a) an isobaric expansion to a 
volume of 10.0 L, (b) an isochoric change to a pressure of 0.50 atm, 
(c) an isobaric compression to a volume of 4.0 L, and (d) an isochoric 
change to a pressure of 2.0 atm. Show these steps on a pV diagram and 
determine from your graph the net work done by the gas. 


Solution: 


W = 900 J 


Exercise: 


Problem: 


What is the average mechanical energy of the atoms of an ideal 
monatomic gas at 300 K? 


Exercise: 


Problem: 


What is the internal energy of 6.00 mol of an ideal monatomic gas at 
200 “C? 


Solution: 


3.53 «10° J 
Exercise: 
Problem: 
Calculate the internal energy of 15 mg of helium at a temperature of 
0°C. 
Exercise: 
Problem: 
Two monatomic ideal gases A and B are at the same temperature. If 
1.0 g of gas A has the same internal energy as 0.10 g of gas B, what 


are (a) the ratio of the number of moles of each gas and (b) the ration 
of the atomic masses of the two gases? 


Solution: 


a. 1:1; b. 10:1 
Exercise: 


Problem: 


The van der Waals coefficients for oxygen are a = 0.138 J - m?® i mol? 
and b = 3.18 x 10~° m*/mol. Use these values to draw a van der 
Waals isotherm of oxygen at 100 K. On the same graph, draw 
isotherms of one mole of an ideal gas. 


Exercise: 


Problem: 


Find the work done in the quasi-static processes shown below. The 
States are given as (p, V) values for the points in the pV plane: 1 (3 
atm, 4 L), 2 (3 atm, 6 L), 3 (5 atm, 4 L), 4 (2 atm, 6 L), 5 (4 atm, 2 L), 
6 (5 atm, 5 L), and 7 (2 atm, 5 L). 


< 


(a) (b) 


(C) (d) 


> 
< 


< 


(e) (f) 


Solution: 


a. 600 J; b. 0; c. 500 J; d. 200 J; e. 800 J; f. 500 J 


s 


s 


s 


Glossary 


internal energy 
average of the total mechanical energy of all the molecules or entities 
in the system 


quasi-static process 
evolution of a system that goes so slowly that the system involved is 
always in thermodynamic equilibrium 


First Law of Thermodynamics 
By the end of this section, you will be able to: 


e State the first law of thermodynamics and explain how it is applied 
e Explain how heat transfer, work done, and internal energy change are 
related in any thermodynamic process 


Now that we have seen how to calculate internal energy, heat, and work done 
for a thermodynamic system undergoing change during some process, we 
can see how these quantities interact to affect the amount of change that can 
occur. This interaction is given by the first law of thermodynamics. British 
scientist and novelist C. P. Snow (1905-1980) is credited with a joke about 
the four laws of thermodynamics. His humorous statement of the first law of 
thermodynamics is stated “you can’t win,” or in other words, you cannot get 
more energy out of a system than you put into it. We will see in this chapter 
how internal energy, heat, and work all play a role in the first law of 
thermodynamics. 


Suppose Q represents the heat exchanged between a system and the 
environment, and W is the work done by or on the system. The first law 
states that the change in internal energy of that system is given by Q — W. 
Since added heat increases the internal energy of a system, Q is positive 
when it is added to the system and negative when it is removed from the 
system. 


When a gas expands, it does work and its internal energy decreases. Thus, W 
is positive when work is done by the system and negative when work is done 
on the system. This sign convention is summarized in [link]. The first law of 
thermodynamics is stated as follows: 


Note: 

First Law of Thermodynamics 

Associated with every equilibrium state of a system is its internal energy 
Fiin,. The change in Fy, for any transition between two equilibrium states is 
Equation: 


A Fint a Q —W 


where Q and W represent, respectively, the heat exchanged by the system 
and the work done by or on the system. 


Thermodynamic Sign Conventions for Heat and Work 


Process Convention 
Heat added to system Q>0 
Heat removed from system Q <0 
Work done by system W>0 
Work done on system W <0 


The first law is a statement of energy conservation. It tells us that a system 
can exchange energy with its surroundings by the transmission of heat and by 
the performance of work. The net energy exchanged is then equal to the 
change in the total mechanical energy of the molecules of the system (i.e., 
the system’s internal energy). Thus, if a system is isolated, its internal energy 
must remain constant. 


Although Q and W both depend on the thermodynamic path taken between 
two equilibrium states, their difference Q — W does not. [link] shows the pV 
diagram of a system that is making the transition from A to B repeatedly 
along different thermodynamic paths. Along path 1, the system absorbs heat 
Q, and does work Wj; along path 2, it absorbs heat @» and does work Ws, 
and so on. The values of @; and W; may vary from path to path, but we have 
Equation: 


Qi1-W1 =Q2-W2=::-=Q:-Wi=-:-:, 


or 
Equation: 


AFinti = AF ing = ++: = AFing =---. 


That is, the change in the internal energy of the system between A and B is 
path independent. In the chapter on potential energy and the conservation of 
energy, we encountered another path-independent quantity: the change in 
potential energy between two arbitrary points in space. This change 
represents the negative of the work done by a conservative force between the 
two points. The potential energy is a function of spatial coordinates, whereas 
the internal energy is a function of thermodynamic variables. For example, 
we might write Fin,(T, p) for the internal energy. Functions such as internal 
energy and potential energy are known as state functions because their values 
depend solely on the state of the system. 

p 


Different thermodynamic paths taken by 
a system in going from state A to state B. 
For all transitions, the change in the 
internal energy of the system 
AE int = Q — W is the same. 


Often the first law is used in its differential form, which is 
Equation: 


dE. — dQ — dW. 


Here dix is an infinitesimal change in internal energy when an 
infinitesimal amount of heat dQ is exchanged with the system and an 
infinitesimal amount of work dW is done by (positive in sign) or on (negative 
in sign) the system. 


Example: 

Changes of State and the First Law 

During a thermodynamic process, a system moves from state A to state B, it 
is supplied with 400 J of heat and does 100 J of work. (a) For this transition, 
what is the system’s change in internal energy? (b) If the system then moves 
from state B back to state A, what is its change in internal energy? (c) If in 
moving from A to B along a different path, W' 43 = 400 J of work is done 
on the system, how much heat does it absorb? 

Strategy 

The first law of thermodynamics relates the internal energy change, work 
done by the system, and the heat transferred to the system in a simple 
equation. The internal energy is a function of state and is therefore fixed at 
any given point regardless of how the system reaches the state. 

Solution 


a. From the first law, the change in the system’s internal energy is 
Equation: 


AF intAB = QaBp — Wap = 400 J — 100 J = 300 J. 


b. Consider a closed path that passes through the states A and B. Internal 
energy is a state function, so Aint is zero for a closed path. Thus 
Equation: 


AEF nt = A Fint AB ae AF intBA = 0, 


and 
Equation: 


AF int AB = —AFptBA : 


This yields 
Equation: 


AFi+pa = —300 J. 


c. The change in internal energy is the same for any path, so 
Equation: 


AE Ap = AE mAp = @ dp-W Ap; 
300J = Q'4p~(—400 J), 


and the heat exchanged is 
Equation: 


On = A008 


The negative sign indicates that the system loses heat in this transition. 


Significance 

When a closed cycle is considered for the first law of thermodynamics, the 
change in internal energy around the whole path is equal to zero. If friction 
were to play a role in this example, less work would result from this heat 
added. [link] takes into consideration what happens if friction plays a role. 


Notice that in [link], we did not assume that the transitions were quasi-static. 
This is because the first law is not subject to such a restriction. It describes 
transitions between equilibrium states but is not concerned with the 
intermediate states. The system does not have to pass through only 


equilibrium states. For example, if a gas in a steel container at a well-defined 
temperature and pressure is made to explode by means of a spark, some of 
the gas may condense, different gas molecules may combine to form new 
compounds, and there may be all sorts of turbulence in the container—but 
eventually, the system will settle down to a new equilibrium state. This 
system is clearly not in equilibrium during its transition; however, its 
behavior is still governed by the first law because the process starts and ends 
with the system in equilibrium states. 


Example: 

Polishing a Fitting 

A machinist polishes a 0.50-kg copper fitting with a piece of emery cloth for 
2.0 min. He moves the cloth across the fitting at a constant speed of 1.0 m/s 
by applying a force of 20 N, tangent to the surface of the fitting. (a) What is 
the total work done on the fitting by the machinist? (b) What is the increase 
in the internal energy of the fitting? Assume that the change in the internal 
energy of the cloth is negligible and that no heat is exchanged between the 
fitting and its environment. (c) What is the increase in the temperature of the 
fitting? 

Strategy 

The machinist’s force over a distance that can be calculated from the speed 
and time given is the work done on the system. The work, in turn, increases 
the internal energy of the system. This energy can be interpreted as the heat 
that raises the temperature of the system via its heat capacity. Be careful 
with the sign of each quantity. 

Solution 


a. The power created by a force on an object or the rate at which the 


—_ 
machinist does frictional work on the fitting is F V=—Fv. Thus, in 
an elapsed time At (2.0 min), the work done on the fitting is 
Equation: 


W =-—FvAt = —(20N)(1.0m/s)(1.2 x 107s) 
=a 102 4)) 


b. By assumption, no heat is exchanged between the fitting and its 
environment, so the first law gives for the change in the internal energy 
of the fitting: 

Equation: 


AE wt = —W =2.4 x 10° J. 


c. Since A Fin¢ is path independent, the effect of the 2.4 x 10° J of 
work is the same as if it were supplied at atmospheric pressure by a 
transfer of heat. Thus, 

Equation: 


2.4 x 10°J = mcAT = (0.50kg)(3.9 x 10? J/kg -°C)AT, 


and the increase in the temperature of the fitting is 
Equation: 


ING eS C) 


where we have used the value for the specific heat of copper, 
eo Ue M0 ay ke) 


Significance 
If heat were released, the change in internal energy would be less and cause 
less of a temperature change than what was calculated in the problem. 


Note: 
Exercise: 


Problem: 
Check Your Understanding The quantities below represent four 


different transitions between the same initial and final state. Fill in the 
blanks. 


Q (J) W (J) AEF in. (J) 


—80 —120 
90 
40 
—40 
Solution: 


Line 1, AFin, = 40 J; line 2, W = 50J and AF, = 40 J; line 3, 
Q = 80J and AF;,;, = 40 J; and line 4, Q = 0 and AE, = 40 J 


Example: 

An Ideal Gas Making Transitions between Two States 

Consider the quasi-static expansions of an ideal gas between the equilibrium 
states A and C of [link]. If 515 J of heat are added to the gas as it traverses 
the path ABC, how much heat is required for the transition along ADC? 
Assume that 

p1 = 2.10 x 10°N/m?,p) = 1.05 x 10°N/m?, V; = 2.25 x 107? m3, 
and V2 = 4.50 x 10°? m?. 

Strategy 

The difference in work done between process ABC and process ADC is the 
area enclosed by ABCD. Because the change of the internal energy (a 
function of state) is the same for both processes, the difference in work is 
thus the same as the difference in heat transferred to the system. 

Solution 

For path ABC, the heat added is Q 48c = 515 J and the work done by the 
gas is the area under the path on the pV diagram, which is 

Equation: 


Wasc = pi(V2 — Vi) = 473 J. 


Along ADC, the work done by the gas is again the area under the path: 
Equation: 


Wapc = p2(V2 — Vi) = 236 J. 


Then using the strategy we just described, we have 
Equation: 


Qapc — Qasc = Wapc — Wazc, 


which leads to 
Equation: 


Qapc = Qasc + Wapc — Wapc = (515 + 236 — 473) J = 278 J. 


Significance 

The work calculations in this problem are made simple since no work is 
done along AD and BC and along AB and DC; the pressure is constant over 
the volume change, so the work done is simply pAV. An isothermal line 
could also have been used, as we have derived the work for an isothermal 
process as W = nRTIn+?. 


Example: 

Isothermal Expansion of an Ideal Gas 

Heat is added to 1 mol of an ideal monatomic gas confined to a cylinder 
with a movable piston at one end. The gas expands quasi-statically at a 
constant temperature of 300 K until its volume increases from V to 3V. (a) 
What is the change in internal energy of the gas? (b) How much work does 
the gas do? (c) How much heat is added to the gas? 

Strategy 

(a) Because the system is an ideal gas, the internal energy only changes 
when the temperature changes. (b) The heat added to the system is therefore 
purely used to do work that has been calculated in Work, Heat, and Internal 
Energy. (c) Lastly, the first law of thermodynamics can be used to calculate 
the heat added to the gas. 


Solution 


a. We saw in the preceding section that the internal energy of an ideal 
monatomic gas is a function only of temperature. Since AT’ = 0, for 
this process, AFin, = 0. 

b. The quasi-static isothermal expansion of an ideal gas was considered in 
the preceding section and was found to be 
Equation: 


W =nkTiny? =nRkTnsy 
= (1.00 mol) (8.314 J/K - mol)(300 K)(In3) = 2.74 x 10° J. 
c. With the results of parts (a) and (b), we can use the first law to 


determine the heat added: 
Equation: 


ey 


which leads to 
Equation: 


Q=W =2.74 x 10° J. 


Significance 
An isothermal process has no change in the internal energy. Based on that, 
the first law of thermodynamics reduces to Q = W. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Why was it necessary to state that the 
process of [link] is quasi-static? 


Solution: 


So that the process is represented by the curve p = nRT/V on the pV 
plot for the evaluation of work. 


Example: 

Vaporizing Water 

When 1.00 g of water at 100 “C changes from the liquid to the gas phase at 
atmospheric pressure, its change in volume is 1.67 x 10-? m?. (a) How 
much heat must be added to vaporize the water? (b) How much work is 
done by the water against the atmosphere in its expansion? (c) What is the 
change in the internal energy of the water? 

Strategy 

We can first figure out how much heat is needed from the latent heat of 
vaporization of the water. From the volume change, we can calculate the 
work done from W = pAV because the pressure is constant. Then, the first 
law of thermodynamics provides us with the change in the internal energy. 
Solution 


a. With L, representing the latent heat of vaporization, the heat required 
to vaporize the water is 
Equation: 


Oj nk (00 (226. 107 Ie 2 G10 J 


b. Since the pressure on the system is constant at 
1.00 atm = 1.01 x 10° N/m’, the work done by the water as it is 
vaporized is 
Equation: 


W = AVS (Ol 10 Nim 6710 ms) — 169 J. 


c. From the first law, the thermal energy of the water during its 
vaporization changes by 
Equation: 


INDO) SY = OO TE A SO 


Significance 

We note that in part (c), we see a change in internal energy, yet there is no 
change in temperature. Ideal gases that are not undergoing phase changes 
have the internal energy proportional to temperature. Internal energy in 
general is the sum of all energy in the system. 


Note: 
Exercise: 


Problem: 


Check Your Understanding When 1.00 g of ammonia boils at 
atmospheric pressure and —33.0 °C, its volume changes from 1.47 to 
1130 cm‘®. Its heat of vaporization at this pressure is 

1.37 x 10° J/kg. What is the change in the internal energy of the 
ammonia when it vaporizes? 


Solution: 


1.26 x 10° J. 


Note: 

View this site to learn about how the first law of thermodynamics. First, 
pump some heavy species molecules into the chamber. Then, play around by 
doing work (pushing the wall to the right where the person is located) to see 
how the internal energy changes (as seen by temperature). Then, look at 
how heat added changes the internal energy. Finally, you can set a parameter 
constant such as temperature and see what happens when you do work to 
keep the temperature constant (Note: You might see a change in these 
variables initially if you are moving around quickly in the simulation, but 
ultimately, this value will return to its equilibrium value). 


Summary 


e The internal energy of a thermodynamic system is a function of state 
and thus is unique for every equilibrium state of the system. 

e The increase in the internal energy of the thermodynamic system is 
given by the heat added to the system less the work done by the system 
in any thermodynamics process. 


Conceptual Questions 


Exercise: 
Problem: 
What does the first law of thermodynamics tell us about the energy of 
the universe? 

Exercise: 


Problem: 


Does adding heat to a system always increase its internal energy? 


Solution: 


If more work is done on the system than heat added, the internal energy 
of the system will actually decrease. 


Exercise: 


Problem: 


A great deal of effort, time, and money has been spent in the quest for a 
so-called perpetual-motion machine, which is defined as a hypothetical 
machine that operates or produces useful work indefinitely and/or a 
hypothetical machine that produces more work or energy than it 
consumes. Explain, in terms of the first law of thermodynamics, why or 
why not such a machine is likely to be constructed. 


Problems 


Exercise: 
Problem: 
When a dilute gas expands quasi-statically from 0.50 to 4.0 L, it does 
250 J of work. Assuming that the gas temperature remains constant at 


300 K, (a) what is the change in the internal energy of the gas? (b) How 
much heat is absorbed by the gas in this process? 


Exercise: 
Problem: 
In an expansion of gas, 500 J of work are done by the gas. If the internal 


energy of the gas increased by 80 J in the expansion, how much heat 
does the gas absorb? 


Solution: 


580 J 
Exercise: 
Problem: 
An ideal gas expands quasi-statically and isothermally from a state with 


pressure p and volume V to a state with volume 4V. How much heat is 
added to the expanding gas? 


Exercise: 
Problem: 
As shown below, if the heat absorbed by the gas along AB is 400 J, 


determine the quantities of heat absorbed along (a) ADB; (b) ACB; and 
(c) ADCB. 


Solution: 


a. 600 J; b. 600 J; c. 800 J 
Exercise: 


Problem: 


During the isobaric expansion from A to B represented below, 3,100 J 
of heat are added to the gas. What is the change in its internal energy? 


p (Nim?) 


1.0 x 104 A=-—h— 8 


0.15 0.30 V(m%) 


Exercise: 


Problem: 


(a) What is the change in internal energy for the process represented by 
the closed path shown below? (b) How much heat is exchanged? (c) If 
the path is traversed in the opposite direction, how much heat is 
exchanged? 


p (atm) 


4.0 


3.0 
~ max 
1.0 R S 
10 20 30 40 50V() 


Solution: 


a. 0; b. 160 J; c. -160 J 
Exercise: 


Problem: 


When a gas expands along path AC shown below, it does 400 J of work 
and absorbs either 200 or 400 J of heat. (a) Suppose you are told that 
along path ABC, the gas absorbs either 200 or 400 J of heat. Which of 
these values is correct? (b) Give the correct answer from part (a), how 
much work is done by the gas along ABC? (c) Along CD, the internal 
energy of the gas decreases by 50 J. How much heat is exchanged by 
the gas along this path? 


Exercise: 


Problem: 


When a gas expands along AB (see below), it does 500 J of work and 
absorbs 250 J of heat. When the gas expands along AC, it does 400 J of 
work and absorbs 300 J of heat. (a) How much heat does the gas 
exchange along BC? (b) When the gas makes the transition from C to A 
along CDA, 800 J of work are done on it from C to D. How much heat 
does it exchange along CDA? 


p 


Solution: 


a. 150 J; b. 700 J 
Exercise: 


Problem: 


A dilute gas is stored in the left chamber of a container whose walls are 
perfectly insulating (see below), and the right chamber is evacuated. 
When the partition is removed, the gas expands and fills the entire 
container. Calculate the work done by the gas. Does the internal energy 
of the gas change in this process? 


Exercise: 


Problem: 


Ideal gases A and B are stored in the left and right chambers of an 
insulated container, as shown below. The partition is removed and the 
gases mix. Is any work done in this process? If the temperatures of A 
and B are initially equal, what happens to their common temperature 
after they are mixed? 


Solution: 


No work is done and they reach the same common temperature. 


Exercise: 


Problem: 


An ideal monatomic gas at a pressure of 2.0 x 10° N/ m’ anda 
temperature of 300 K undergoes a quasi-static isobaric expansion from 
2.0 x 10° to4.0 x 10° cm’. (a) What is the work done by the gas? 
(b) What is the temperature of the gas after the expansion? (c) How 
many moles of gas are there? (d) What is the change in internal energy 
of the gas? (e) How much heat is added to the gas? 


Exercise: 


Problem: 


Consider the process for steam in a cylinder shown below. Suppose the 
change in the internal energy in this process is 30 kJ. Find the heat 
entering the system. 


p (atm) 


50 


20 


5 12 Vil) 


Solution: 


54,500 J 
Exercise: 


Problem: 


The state of 30 moles of steam in a cylinder is changed in a cyclic 
manner from a-b-c-a, where the pressure and volume of the states are: a 
(30 atm, 20 L), b (50 atm, 20 L), and c (50 atm, 45 L). Assume each 
change takes place along the line connecting the initial and final states 
in the pV plane. (a) Display the cycle in the pV plane. (b) Find the net 
work done by the steam in one cycle. (c) Find the net amount of heat 
flow in the steam over the course of one cycle. 


Exercise: 


Problem: 


A monatomic ideal gas undergoes a quasi-static process that is 
described by the function p(V) = pi + 3(V — Vi), where the starting 
state is (p;, V,) and the final state (p2, V2). Assume the system consists 
of n moles of the gas in a container that can exchange heat with the 
environment and whose volume can change freely. (a) Evaluate the 
work done by the gas during the change in the state. (b) Find the change 
in internal energy of the gas. (c) Find the heat input to the gas during the 
change. (d) What are initial and final temperatures? 


Solution: 


d. (p1 a3 3V7) (Va _ Vi) = 3Vi (V3 = V7) ae (Vs _ Ve): b. 
3 (p2V2 — piV1); c. the sum of parts (a) and (b); d. T; = pit and 
T, = bate 


Exercise: 


Problem: 

A metallic container of fixed volume of 2.5 x 10 ~° m? immersed ina 
large tank of temperature 27 “C contains two compartments separated 
by a freely movable wall. Initially, the wall is kept in place by a stopper 
so that there are 0.02 mol of the nitrogen gas on one side and 0.03 mol 
of the oxygen gas on the other side, each occupying half the volume. 
When the stopper is removed, the wall moves and comes to a final 
position. The movement of the wall is controlled so that the wall moves 
in infinitesimal quasi-static steps. (a) Find the final volumes of the two 
sides assuming the ideal gas behavior for the two gases. (b) How much 
work does each gas do on the other? (c) What is the change in the 
internal energy of each gas? (d) Find the amount of heat that enters or 
leaves each gas. 


Exercise: 


Problem: 

A gas in a cylindrical closed container is adiabatically and quasi- 
statically expanded from a state A (3 MPa, 2 L) to a state B with volume 
of 6 L along the path 1.8 pV = constant. (a) Plot the path in the pV 
plane. (b) Find the amount of work done by the gas and the change in 
the internal energy of the gas during the process. 


Solution: 


p (MPa) 


5 VO 
b. W =4.39kJ, AEim, = —4.39 kJ 


Glossary 


first law of thermodynamics 
the change in internal energy for any transition between two equilibrium 
states is AFing = Q —W 


Thermodynamic Processes 
By the end of this section, you will be able to: 


e Define a thermodynamic process 

e Distinguish between quasi-static and non-quasi-static processes 

¢ Calculate physical quantities, such as the heat transferred, work done, 
and internal energy change for isothermal, adiabatic, and cyclical 
thermodynamic processes 


In solving mechanics problems, we isolate the body under consideration, 
analyze the external forces acting on it, and then use Newton’s laws to 
predict its behavior. In thermodynamics, we take a similar approach. We 
start by identifying the part of the universe we wish to study; it is also 
known as our system. (We defined a system at the beginning of this chapter 
as anything whose properties are of interest to us; it can be a single atom or 
the entire Earth.) Once our system is selected, we determine how the 
environment, or surroundings, interact with the system. Finally, with the 
interaction understood, we study the thermal behavior of the system with 
the help of the laws of thermodynamics. 


The thermal behavior of a system is described in terms of thermodynamic 
variables. For an ideal gas, these variables are pressure, volume, 
temperature, and the number of molecules or moles of the gas. Different 
types of systems are generally characterized by different sets of variables. 
For example, the thermodynamic variables for a stretched rubber band are 
tension, length, temperature, and mass. 


The state of a system can change as a result of its interaction with the 
environment. The change in a system can be fast or slow and large or small. 
The manner in which a state of a system can change from an initial state to 
a final state is called a thermodynamic process. For analytical purposes in 
thermodynamics, it is helpful to divide up processes as either quasi-static or 
non-quasi-static, aS we now explain. 


Quasi-static and Non-quasi-static Processes 


A quasi-static process refers to an idealized or imagined process where the 
change in state is made infinitesimally slowly so that at each instant, the 
system can be assumed to be at a thermodynamic equilibrium with itself 
and with the environment. For instance, imagine heating 1 kg of water from 
a temperature 20 °C to 21 °C at a constant pressure of 1 atmosphere. To 
heat the water very slowly, we may imagine placing the container with 
water in a large bath that can be slowly heated such that the temperature of 
the bath can rise infinitesimally slowly from 20 °C to 21 °C. If we put 1 kg 
of water at 20 °C directly into a bath at 21 °C, the temperature of the water 
will rise rapidly to 21 °C in a non-quasi-static way. 


Quasi-static processes are done slowly enough that the system remains at 
thermodynamic equilibrium at each instant, despite the fact that the system 
changes over time. The thermodynamic equilibrium of the system is 
necessary for the system to have well-defined values of macroscopic 
properties such as the temperature and the pressure of the system at each 
instant of the process. Therefore, quasi-static processes can be shown as 
well-defined paths in state space of the system. 


Since quasi-static processes cannot be completely realized for any finite 
change of the system, all processes in nature are non-quasi-static. Examples 
of quasi-static and non-quasi-static processes are shown in [link]. Despite 
the fact that all finite changes must occur essentially non-quasi-statically at 
some stage of the change, we can imagine performing infinitely many 
quasi-static process corresponding to every quasi-static process. Since 
quasi-static processes can be analyzed analytically, we mostly study quasi- 
static processes in this book. We have already seen that in a quasi-static 
process the work by a gas is given by pdV. 


Nonquasi-static 
process _---~~. 
“ 


Pr / \ 


Pi A ————>——_ B 
Quasi-static process 


VA 


Vi 


Quasi-static and non-quasi-static processes 
between states A and B of a gas. In a quasi- 
Static process, the path of the process between 
A and B can be drawn in a state diagram since 
all the states that the system goes through are 
known. In a non-quasi-static process, the states 
between A and B are not known, and hence no 
path can be drawn. It may follow the dashed 
line as shown in the figure or take a very 
different path. 


Isothermal Processes 


An isothermal process is a change in the state of the system at a constant 
temperature. This process is accomplished by keeping the system in thermal 
equilibrium with a large heat bath during the process. Recall that a heat bath 
is an idealized “infinitely” large system whose temperature does not 
change. In practice, the temperature of a finite bath is controlled by either 
adding or removing a finite amount of energy as the case may be. 


As an illustration of an isothermal process, consider a cylinder of gas with a 
movable piston immersed in a large water tank whose temperature is 
maintained constant. Since the piston is freely movable, the pressure inside 
P, is balanced by the pressure outside P,y, by some weights on the piston, 
as in [link]. 


Expanding a system at a constant temperature. Removing weights on 
the piston leads to an imbalance of forces on the piston, which causes 
the piston to move up. As the piston moves up, the temperature is 
lowered momentarily, which causes heat to flow from the heat bath to 
the system. The energy to move the piston eventually comes from the 
heat bath. 


As weights on the piston are removed, an imbalance of forces on the piston 
develops. The net nonzero force on the piston would cause the piston to 
accelerate, resulting in an increase in volume. The expansion of the gas 
cools the gas to a lower temperature, which makes it possible for the heat to 
enter from the heat bath into the system until the temperature of the gas is 
reset to the temperature of the heat bath. If weights are removed in 
infinitesimal steps, the pressure in the system decreases infinitesimally 
slowly. This way, an isothermal process can be conducted quasi-statically. 
An isothermal line on a (p, V) diagram is represented by a curved line from 


starting point A to finishing point B, as seen in [link]. For an ideal gas, an 
isothermal process is hyperbolic, since for an ideal gas at constant 
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An isothermal 
expansion from a state 
labeled A to another 
state labeled B on a pV 
diagram. The curve 
represents the relation 
between pressure and 
volume in an ideal gas 
at constant temperature. 


An isothermal process studied in this chapter is quasi-statically performed, 
since to be isothermal throughout the change of volume, you must be able 
to state the temperature of the system at each step, which is possible only if 
the system is in thermal equilibrium continuously. The system must go out 
of equilibrium for the state to change, but for quasi-static processes, we 
imagine that the process is conducted in infinitesimal steps such that these 
departures from equilibrium can be made as brief and as small as we like. 


Other quasi-static processes of interest for gases are isobaric and isochoric 
processes. An isobaric process is a process where the pressure of the 
system does not change, whereas an isochoric process is a process where 
the volume of the system does not change. 


Adiabatic Processes 


In an adiabatic process, the system is insulated from its environment so 
that although the state of the system changes, no heat is allowed to enter or 
leave the system, as seen in [link]. An adiabatic process can be conducted 
either quasi-statically or non-quasi-statically. When a system expands 
adiabatically, it must do work against the outside world, and therefore its 
energy goes down, which is reflected in the lowering of the temperature of 
the system. An adiabatic expansion leads to a lowering of temperature, and 
an adiabatic compression leads to an increase of temperature. We discuss 
adiabatic expansion again in Adiabatic Processes for an ideal Gas. 


Insulation 
System 


An insulated piston with a hot, 
compressed gas is released. The 
piston moves up, the volume 
expands, and the pressure and 
temperature decrease. The internal 


energy goes into work. If the 
expansion occurs within a time 
frame in which negligible heat can 
enter the system, then the process is 
called adiabatic. Ideally, during an 
adiabatic process no heat enters or 
exits the system. 


Cyclic Processes 


We say that a system goes through a cyclic process if the state of the system 
at the end is same as the state at the beginning. Therefore, state properties 
such as temperature, pressure, volume, and internal energy of the system do 
not change over a complete cycle: 

Equation: 


A Eine a 0. 


When the first law of thermodynamics is applied to a cyclic process, we 
obtain a simple relation between heat into the system and the work done by 
the system over the cycle: 

Equation: 


Q = W (cyclic process). 


Thermodynamic processes are also distinguished by whether or not they are 
reversible. A reversible process is one that can be made to retrace its path 
by differential changes in the environment. Such a process must therefore 
also be quasi-static. Note, however, that a quasi-static process is not 
necessarily reversible, since there may be dissipative forces involved. For 
example, if friction occurred between the piston and the walls of the 
cylinder containing the gas, the energy lost to friction would prevent us 
from reproducing the original states of the system. 


We considered several thermodynamic processes: 
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. An isothermal process, during which the system’s temperature remains 


constant 


. An adiabatic process, during which no heat is transferred to or from 


the system 


. An isobaric process, during which the system’s pressure does not 


change 


. An isochoric process, during which the system’s volume does not 


change 


Many other processes also occur that do not fit into any of these four 
categories. 


Note: 

View this site to set up your own process in a pV diagram. See if you can 
calculate the values predicted by the simulation for heat, work, and change 
in internal energy. 


Summary 


The thermal behavior of a system is described in terms of 
thermodynamic variables. For an ideal gas, these variables are 
pressure, volume, temperature, and number of molecules or moles of 
the gas. 

For systems in thermodynamic equilibrium, the thermodynamic 
variables are related by an equation of state. 

A heat reservoir is so large that when it exchanges heat with other 
systems, its temperature does not change. 

A quasi-static process takes place so slowly that the system involved is 
always in thermodynamic equilibrium. 

A reversible process is one that can be made to retrace its path and 
both the temperature and pressure are uniform throughout the system. 


e There are several types of thermodynamic processes, including (a) 
isothermal, where the system’s temperature is constant; (b) adiabatic, 
where no heat is exchanged by the system; (c) isobaric, where the 
system’s pressure is constant; and (d) isochoric, where the system’s 
volume is constant. 

e Asa consequence of the first law of thermodymanics, here is a 
summary of the thermodymaic processes: (a) isothermal: 

AE int = 0,Q = W; (b) adiabatic: Q = 0, AFin = —W; (c) 
isobaric: AFing = Q — W; and (d) isochoric: W = 0, AFin = Q. 


Conceptual Questions 


Exercise: 
Problem: 


When a gas expands isothermally, it does work. What is the source of 
energy needed to do this work? 


Solution: 
The system must be in contact with a heat source that allows heat to 
flow into the system. 
Exercise: 
Problem: 
If the pressure and volume of a system are given, is the temperature 
always uniquely determined? 
Exercise: 
Problem: 
It is unlikely that a process can be isothermal unless it is a very slow 


process. Explain why. Is the same true for isobaric and isochoric 
processes? Explain your answer. 


Solution: 


Isothermal processes must be slow to make sure that as heat is 
transferred, the temperature does not change. Even for isobaric and 
isochoric processes, the system must be in thermal equilibrium with 
slow changes of thermodynamic variables. 


Problems 


Exercise: 


Problem: 


Two moles of a monatomic ideal gas at (5 MPa, 5 L) is expanded 
isothermally until the volume is doubled (step 1). Then it is cooled 
isochorically until the pressure is 1 MPa (step 2). The temperature 
drops in this process. The gas is now compressed isothermally until its 
volume is back to 5 L, but its pressure is now 2 MPa (step 3). Finally, 
the gas is heated isochorically to return to the initial state (step 4). (a) 
Draw the four processes in the pV plane. (b) Find the total work done 
by the gas. 


Exercise: 


Problem: 


Consider a transformation from point A to B in a two-step process. 
First, the pressure is lowered from 3 MPa at point A to a pressure of 1 
MPa, while keeping the volume at 2 L by cooling the system. The state 
reached is labeled C. Then the system is heated at a constant pressure 
to reach a volume of 6 L in the state B. (a) Find the amount of work 
done on the ACB path. (b) Find the amount of heat exchanged by the 
system when it goes from A to B on the ACB path. (c) Compare the 
change in the internal energy when the AB process occurs adiabatically 
with the AB change through the two-step process on the ACB path. 


Solution: 


a. 1660 J; b. -2730 J; c. It does not depend on the process. 


Exercise: 


Problem: 


Consider a cylinder with a movable piston containing n moles of an 
ideal gas. The entire apparatus is immersed in a constant temperature 
bath of temperature T kelvin. The piston is then pushed slowly so that 
the pressure of the gas changes quasi-statically from pj, to pp at 
constant temperature T. Find the work done by the gas in terms of n, R, 
A Pi, and p2. 


Exercise: 
Problem: 
An ideal gas expands isothermally along AB and does 700 J of work 
(see below). (a) How much heat does the gas exchange along AB? (b) 
The gas then expands adiabatically along BC and does 400 J of work. 


When the gas returns to A along CA, it exhausts 100 J of heat to its 
surroundings. How much work is done on the gas along this path? 


p 


Solution: 


a. 700 J; b. 500 J 


Exercise: 


Problem: 


Consider the processes shown below for a monatomic gas. (a) Find the 
work done in each of the processes AB, BC, AD, and DC. (b) Find the 
internal energy change in processes AB and BC. (c) Find the internal 
energy difference between states C and A. (d) Find the total heat added 
in the ADC process. (e) From the information given, can you find the 
heat added in process AD? Why or why not? 


p (atm) 
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Exercise: 


Problem: 


Two moles of helium gas are placed in a cylindrical container with a 
piston. The gas is at room temperature 25 °C and under a pressure of 
3.0 x 10° Pa. When the pressure from the outside is decreased while 
keeping the temperature the same as the room temperature, the volume 
of the gas doubles. (a) Find the work the external agent does on the gas 
in the process. (b) Find the heat exchanged by the gas and indicate 
whether the gas takes in or gives up heat. Assume ideal gas behavior. 


Solution: 


a. -3 400 J; b. 3400 J enters the gas 


Exercise: 


Problem: 


An amount of n moles of a monatomic ideal gas in a conducting 
container with a movable piston is placed in a large thermal heat bath 
at temperature 7} and the gas is allowed to come to equilibrium. After 
the equilibrium is reached, the pressure on the piston is lowered so that 
the gas expands at constant temperature. The process is continued 
quasi-statically until the final pressure is 4/3 of the initial pressure pj. 
(a) Find the change in the internal energy of the gas. (b) Find the work 
done by the gas. (c) Find the heat exchanged by the gas, and indicate, 
whether the gas takes in or gives up heat. 


Glossary 


adiabatic process 
process during which no heat is transferred to or from the system 


cyclic process 
process in which the state of the system at the end is same as the state 
at the beginning 


isobaric process 
process during which the system’s pressure does not change 


isochoric process 
process during which the system’s volume does not change 


isothermal process 
process during which the system’s temperature remains constant 


reversible process 
process that can be reverted to restore both the system and its 
environment back to their original states together 


thermodynamic process 


manner in which a state of a system can change from initial state to 
final state 


Heat Capacities of an Ideal Gas 
By the end of this section, you will be able to: 


e Define heat capacity of an ideal gas for a specific process 

e Calculate the specific heat of an ideal gas for either an isobaric or 
isochoric process 

e Explain the difference between the heat capacities of an ideal gas and a 
real gas 

e Estimate the change in specific heat of a gas over temperature ranges 


We learned about specific heat and molar heat capacity in Temperature and 
Heat; however, we have not considered a process in which heat is added. We do 
that in this section. First, we examine a process where the system has a constant 
volume, then contrast it with a system at constant pressure and show how their 
specific heats are related. 


Let’s start with looking at [link], which shows two vessels A and B, each 
containing 1 mol of the same type of ideal gas at a temperature T and a volume 
V. The only difference between the two vessels is that the piston at the top of A 
is fixed, whereas the one at the top of B is free to move against a constant 
external pressure p. We now consider what happens when the temperature of 
the gas in each vessel is slowly increased to T + dT with the addition of heat. 


Vessel A Vessel B 


Two vessels are identical except that the 
piston at the top of A is fixed, whereas that 
atop B is free to move against a constant 
external pressure p. 


Since the piston of vessel A is fixed, the volume of the enclosed gas does not 
change. Consequently, the gas does no work, and we have from the first law 
Equation: 


dE... — dQ — dW = dQ. 


We represent the fact that the heat is exchanged at constant volume by writing 
Equation: 


dQ = CyndT, 


where Cy is the molar heat capacity at constant volume of the gas. In 
addition, since dE;,, = dQ for this particular process, 
Equation: 


dE int = CyndT. 


We obtained this equation assuming the volume of the gas was fixed. However, 
internal energy is a state function that depends on only the temperature of an 
ideal gas. Therefore, dEingt = CyndT gives the change in internal energy of an 
ideal gas for any process involving a temperature change dT. 


When the gas in vessel B is heated, it expands against the movable piston and 
does work dW = pdV. In this case, the heat is added at constant pressure, and 
we write 

Equation: 


dQ = C,ndT, 


where C,, is the molar heat capacity at constant pressure of the gas. 
Furthermore, since the ideal gas expands against a constant pressure, 
Equation: 


d(pV) = d(RnT) 


becomes 
Equation: 


pdV = RndT. 


Finally, inserting the expressions for dQ and pdV into the first law, we obtain 
Equation: 


dEin, = dQ — pdV = (Cyn — Rn)dT. 


We have found d£;,; for both an isochoric and an isobaric process. Because the 
internal energy of an ideal gas depends only on the temperature, dF;,4 must be 
the same for both processes. Thus, 

Equation: 


CyndT = (C,n — Rn)dT, 


and 


Note: 
Equation: 


Cp = Cy +R. 


The derivation of [link] was based only on the ideal gas law. Consequently, this 
relationship is approximately valid for all dilute gases, whether monatomic like 
He, diatomic like Oz, or polyatomic like CO2 or NH3. 


In the preceding chapter, we found the molar heat capacity of an ideal gas under 
constant volume to be 
Equation: 


where d is the number of degrees of freedom of a molecule in the system. [link] 
shows the molar heat capacities of some dilute ideal gases at room temperature. 
The heat capacities of real gases are somewhat higher than those predicted by 
the expressions of Cy and C, given in [link]. This indicates that vibrational 
motion in polyatomic molecules is significant, even at room temperature. 
Nevertheless, the difference in the molar heat capacities, C, — Cy, is very 
close to R, even for the polyatomic gases. 


Molar Heat Capacities of Dilute Ideal Gases at Room Temperature 


Cp — Cv 
Type of Ch Cy (J/mol 
Molecule Gas (J/mol K) (J/mol K) K) 


Monatomic Ideal 2R=20.79 #R=1247 R=8.31 
Diatomic Ideal JSR=29.10 2R=20.79 R=8.31 


Polyatomic Ideal AR = 33.26 3R = 24.94 R= 8.31 


Summary 


¢ For an ideal gas, the molar capacity at constant pressure C;,, is given by 
Cy, = Cy + R = dR/2 + R, where d is the number of degrees of 
freedom of each molecule/entity in the system. 

e A real gas has a specific heat close to but a little bit higher than that of the 
corresponding ideal gas with C, ~ Cy + R. 


Conceptual Questions 


Exercise: 
Problem: 
How can an object transfer heat if the object does not possess a discrete 
quantity of heat? 

Exercise: 
Problem: 
Most materials expand when heated. One notable exception is water 
between 0 °C and 4 °C, which actually decreases in volume with the 


increase in temperature. Which is greater for water in this temperature 
region, C, or Cy ? 


Solution: 


Typically C, is greater than Cy because when expansion occurs under 
constant pressure, it does work on the surroundings. Therefore, heat can go 
into internal energy and work. Under constant volume, all heat goes into 
internal energy. In this example, water contracts upon heating, so if we add 
heat at constant pressure, work is done on the water by surroundings and 
therefore, C’,, is less than Cy. 


Exercise: 


Problem: 


Why are there two specific heats for gases C, and Cy, yet only one given 
for solid? 


Problems 


Exercise: 


Problem: 


The temperature of an ideal monatomic gas rises by 8.0 K. What is the 
change in the internal energy of 1 mol of the gas at constant volume? 


Solution: 


100 J 
Exercise: 
Problem: 
For a temperature increase of 10 °C at constant volume, what is the heat 


absorbed by (a) 3.0 mol of a dilute monatomic gas; (b) 0.50 mol of a dilute 
diatomic gas; and (c) 15 mol of a dilute polyatomic gas? 


Exercise: 


Problem: 


If the gases of the preceding problem are initially at 300 K, what are their 
internal energies after they absorb the heat? 


Solution: 


a. 370 J; b. 100 J; c. 500 J 
Exercise: 


Problem: 


Consider 0.40 mol of dilute carbon dioxide at a pressure of 0.50 atm and a 
volume of 50 L. What is the internal energy of the gas? 


Exercise: 


Problem: 


When 400 J of heat are slowly added to 10 mol of an ideal monatomic gas, 
its temperature rises by 10 “C. What is the work done on the gas? 


Solution: 


850 J 


Exercise: 


Problem: 


One mole of a dilute diatomic gas occupying a volume of 10.00 L expands 
against a constant pressure of 2.000 atm when it is slowly heated. If the 
temperature of the gas rises by 10.00 K and 400.0 J of heat are added in 
the process, what is its final volume? 


Glossary 


molar heat capacity at constant pressure 
quantifies the ratio of the amount of heat added removed to the 
temperature while measuring at constant pressure 


molar heat capacity at constant volume 
quantifies the ratio of the amount of heat added removed to the 
temperature while measuring at constant volume 


Adiabatic Processes for an Ideal Gas 
By the end of this section, you will be able to: 


e Define adiabatic expansion of an ideal gas 
e Demonstrate the qualitative difference between adiabatic and isothermal 
expansions 


When an ideal gas is compressed adiabatically (Q = 0), work is done on it and 
its temperature increases; in an adiabatic expansion, the gas does work and its 
temperature drops. Adiabatic compressions actually occur in the cylinders of a 
car, where the compressions of the gas-air mixture take place so quickly that 
there is no time for the mixture to exchange heat with its environment. 
Nevertheless, because work is done on the mixture during the compression, its 
temperature does rise significantly. In fact, the temperature increases can be so 
large that the mixture can explode without the addition of a spark. Such 
explosions, since they are not timed, make a car run poorly—it usually “knocks.” 
Because ignition temperature rises with the octane of gasoline, one way to 
overcome this problem is to use a higher-octane gasoline. 


Another interesting adiabatic process is the free expansion of a gas. [link] shows 
a gas confined by a membrane to one side of a two-compartment, thermally 
insulated container. When the membrane is punctured, gas rushes into the empty 
side of the container, thereby expanding freely. Because the gas expands “against 
a vacuum” (p = 0), it does no work, and because the vessel is thermally 
insulated, the expansion is adiabatic. With Q = 0 and W = 0 in the first law, 
AEF ing = 0, 80 Ein; = Eine ¢ for the free expansion. 


Initial equilibrium state Final equilibrium state 


Vacuum 


The gas in the left chamber expands freely into the right chamber when the 
membrane is punctured. 


If the gas is ideal, the internal energy depends only on the temperature. 
Therefore, when an ideal gas expands freely, its temperature does not change. 


A quasi-static, adiabatic expansion of an ideal gas is represented in [link], which 
shows an insulated cylinder that contains 1 mol of an ideal gas. The gas is made 
to expand quasi-statically by removing one grain of sand at a time from the top 
of the piston. When the gas expands by dV, the change in its temperature is dT. 
The work done by the gas in the expansion is dW = pdV; dQ = 0 because the 
cylinder is insulated; and the change in the internal energy of the gas is, from 
[link], dEing = CyndT’. Therefore, from the first law, 

Equation: 


CyndT = 0 — pdV = —pdV, 


0) 
Equation: 


Insulation 


When sand is removed from the piston one grain at 
a time, the gas expands adiabatically and quasi- 
statically in the insulated vessel. 


Also, for 1 mol of an ideal gas, 
Equation: 


d(pV) = d(RnT), 


SO 
Equation: 


pdV + Vdp = RndT 


and 

Equation: 

_ pdV + Vdp 
7 Rn 


dT’ 


We now have two equations for dT. Upon equating them, we find that 
Equation: 


CynVdp + (Cyn + Rn)pdV = 0. 


Now, we divide this equation by npV and use C, = Cy + R. We are then left 
with 


Equation: 
dp dV 
Cy = rs ad 0, 
which becomes 
Equation: 
dp, dV _ | 
i veo 


where we define y as the ratio of the molar heat capacities: 


Note: 
Equation: 


Thus, 
Equation: 


and 
Equation: 


Inp+yln V = constant. 


Finally, using n(A*) = alnA and In AB = In A + In B, we can write this in 
the form 


Note: 
Equation: 


pV” =constant. 


This equation is the condition that must be obeyed by an ideal gas in a quasi- 
static adiabatic process. For example, if an ideal gas makes a quasi-static 
adiabatic transition from a state with pressure and volume p and Vj to a state 
with pz and Va, then it must be true that pi V," = poV,". 


The adiabatic condition of [link] can be written in terms of other pairs of 
thermodynamic variables by combining it with the ideal gas law. In doing this, 
we find that 

Equation: 


p ’T’ = constant 


and 
Equation: 


TV 1-! = constant. 


A reversible adiabatic expansion of an ideal gas is represented on the pV diagram 
of [link]. The slope of the curve at any point is 
Equation: 


dp de constant \ sp 
dv dv 7 


_ = 


baal 
“- 
=e, 


Adiabatic 


Quasi-static adiabatic and isothermal 
expansions of an ideal gas. 


The dashed curve shown on this pV diagram represents an isothermal expansion 
where T (and therefore pV) is constant. The slope of this curve is useful when we 
consider the second law of thermodynamics in the next chapter. This slope is 
Equation: 


dp d nRT_ p 


dV dV Vo VW’ 


Because y > 1, the isothermal curve is not as steep as that for the adiabatic 
expansion. 


Example: 

Compression of an Ideal Gas in an Automobile Engine 

Gasoline vapor is injected into the cylinder of an automobile engine when the 
piston is in its expanded position. The temperature, pressure, and volume of the 
resulting gas-air mixture are 20 °C, 1.00 x 10° N/m’, and 240 cm, 
respectively. The mixture is then compressed adiabatically to a volume of 

40 cm®. Note that in the actual operation of an automobile engine, the 
compression is not quasi-static, although we are making that assumption here. 
(a) What are the pressure and temperature of the mixture after the compression? 
(b) How much work is done by the mixture during the compression? 

Strategy 

Because we are modeling the process as a quasi-static adiabatic compression of 


an ideal gas, we have pV? = constant and pV = nRT. The work needed can 
2 


then be evaluated with W = pdV. 
Vi 
Solution 


a. For an adiabatic compression we have 
Equation: 


so after the compression, the pressure of the mixture is 
Equation: 


240 x 10°-° m? 


= (1.00 x 10° N/m?) ( 
Pa = ( poe 40 x 10-8 m3 


1.40 
) = 1.23 x 10°N/m?. 


From the ideal gas law, the temperature of the mixture after the 
compression is 
Equation: 


T, = (24)T 
~~ (1.00 x 10° N/m?)(240 x 10-6 m3) aa 
= (00K = 328 C. 


b. The work done by the mixture during the compression is 
Equation: 
V2 


ii || ane 
Vi 


With the adiabatic condition of [link], we may write p as K/VY, where 
K = p,V," = peV,". The work is therefore 


Equation: 
5 
ie 
W = —dV 
vy, VO 
= ( 1 1 ) 
1—-y \ Var Ver 
= (3 mens ) 
1—-y \ Var Vi 
~ Ty (p2Ve = 4) 
= 1 al(1.23 x 10°N/m*)(40 x 10-° m) 
—(1.00 x 10°N/m?)(240 x 10~°m)] 
= —63 J. 
Significance 


The negative sign on the work done indicates that the piston does work on the 
gas-air mixture. The engine would not work if the gas-air mixture did work on 


the piston. 


Summary 


e A quasi-static adiabatic expansion of an ideal gas produces a steeper pV 
curve than that of the corresponding isotherm. 
e A realistic expansion can be adiabatic but rarely quasi-static. 


Key Equations 
Equation of state for a closed f(p,V,T) =0 
system Peri 
ee ; V2 
Net work for a finite change in _ 
volume W= i ' pdV 


Internal energy of a system (average Ent = S(K i+ U;), 


total energy) i 

Internal energy of a monatomic Pan ( - kpT) = = nRT 
ideal gas 

First law of thermodynamics AEFin, = Q —W 

Molar heat capacity at constant CoC R 

pressure 

Ratio of molar heat capacities y = C,/Cy 


Condition for an ideal gas ina 


pte pV” = constant 
quasi-static adiabatic process 


Conceptual Questions 


Exercise: 


Problem: Is it possible for y to be smaller than unity? 


Solution: 


No, it is always greater than 1. 


Exercise: 


Problem: Would you expect y to be larger for a gas or a solid? Explain. 
Exercise: 

Problem: 

There is no change in the internal energy of an ideal gas undergoing an 

isothermal process since the internal energy depends only on the 


temperature. Is it therefore correct to say that an isothermal process is the 
same as an adiabatic process for an ideal gas? Explain your answer. 


Solution: 


An adiabatic process has a change in temperature but no heat flow. The 
isothermal process has no change in temperature but has heat flow. 


Exercise: 


Problem: 
Does a gas do any work when it expands adiabatically? If so, what is the 
source of the energy needed to do this work? 

Problems 


Exercise: 


Problem: 


A monatomic ideal gas undergoes a quasi-static adiabatic expansion in 
which its volume is doubled. How is the pressure of the gas changed? 


Solution: 


pressure decreased by 0.31 times the original pressure 


Exercise: 
Problem: 
An ideal gas has a pressure of 0.50 atm and a volume of 10 L. It is 


compressed adiabatically and quasi-statically until its pressure is 3.0 atm 
and its volume is 2.8 L. Is the gas monatomic, diatomic, or polyatomic? 


Exercise: 
Problem: 
Pressure and volume measurements of a dilute gas undergoing a quasi-static 


adiabatic expansion are shown below. Plot In p vs. V and determine y for 
this gas from your graph. 


P (atm) V (L) 
20.0 1.0 
17.0 1.1 
14.0 1.3 
11.0 1.5 
8.0 2.0 
5.0 2.6 
2.0 a2 
1.0 8.4 


Solution: 


10.0 y = 8.4372x-0-718 


a=) 113 
Exercise: 


Problem: 


An ideal monatomic gas at 300 K expands adiabatically and reversibly to 


twice its volume. What is its final temperature? 
Exercise: 


Problem: 


An ideal diatomic gas at 80 K is slowly compressed adiabatically and 


reversibly to half its volume. What is its final temperature? 


Solution: 


84 K 
Exercise: 


Problem: 


An ideal diatomic gas at 80 K is slowly compressed adiabatically to one- 


third its original volume. What is its final temperature? 


Exercise: 
Problem: 
Compare the charge in internal energy of an ideal gas for a quasi-static 


adiabatic expansion with that for a quasi-static isothermal expansion. What 
happens to the temperature of an ideal gas in an adiabatic expansion? 


Solution: 


An adiabatic expansion has less work done and no heat flow, thereby a 
lower internal energy comparing to an isothermal expansion which has both 
heat flow and work done. Temperature decreases during adiabatic 
expansion. 


Exercise: 
Problem: 
The temperature of n moles of an ideal gas changes from 7) to T3 ina 


quasi-static adiabatic transition. Show that the work done by the gas is 
given by 


W= (Ti —T2). 


Exercise: 


Problem: 
A dilute gas expands quasi-statically to three times its initial volume. Is the 
final gas pressure greater for an isothermal or an adiabatic expansion? Does 


your answer depend on whether the gas is monatomic, diatomic, or 
polyatomic? 


Solution: 


Isothermal has a greater final pressure and does not depend on the type of 
gas. 


Exercise: 


Problem: 


(a) An ideal gas expands adiabatically from a volume of 2.0 x 10°-° m? to 
2.5 x 10° m?. If the initial pressure and temperature were 5.0 x 10° Pa 
and 300 K, respectively, what are the final pressure and temperature of the 
gas? Use y = 5/3 for the gas. (b) In an isothermal process, an ideal gas 
expands from a volume of 2.0 x 10°-2m*to2.5 x 10°? m®. If the initial 
pressure and temperature were 5.0 x 10° Pa and 300 K, respectively, what 
are the final pressure and temperature of the gas? 


Exercise: 


Problem: 


On an adiabatic process of an ideal gas pressure, volume and temperature 
change such that pV” is constant with -~ = 5/3 for monatomic gas such as 
helium and -y = 7/5 for diatomic gas such as hydrogen at room 
temperature. Use numerical values to plot two isotherms of 1 mol of helium 
gas using ideal gas law and two adiabatic processes mediating between 
them. Use T; = 500K, V; = 1L, and TJ) = 300 K for your plot. 


Solution: 


Adiabatic 


1.0 1.2 1.4 1.6 1.8 2.0 V(L) 


Exercise: 


Problem: 


Two moles of a monatomic ideal gas such as helium is compressed 
adiabatically and reversibly from a state (3 atm, 5 L) to a state with pressure 
4 atm. (a) Find the volume and temperature of the final state. (b) Find the 
temperature of the initial state of the gas. (c) Find the work done by the gas 
in the process. (d) Find the change in internal energy of the gas in the 
process. 


Additional Problems 


Exercise: 


Problem: 


Consider the process shown below. During steps AB and BC, 3600 J and 
2400 J of heat, respectively, are added to the system. (a) Find the work done 
in each of the processes AB, BC, AD, and DC. (b) Find the internal energy 
change in processes AB and BC. (c) Find the internal energy difference 
between states C and A. (d) Find the total heat added in the ADC process. 
(e) From the information given, can you find the heat added in process AD? 
Why or why not? 


p (atm) 
5 


Solution: 


a. Wap = 0, Waco = 2026 J,Wap = 810.43, Wc = 0; b. 

AE ap = 3600 J, AE go = 3743; c. AE 4c = 3974 J; d. 

Q apc = 4784 J; e. No, because heat was added for both parts AD and DC. 
There is not enough information to figure out how much is from each 
segment of the path. 


Exercise: 
Problem: 
A car tire contains 0.0380 m? of air at a pressure of 2.20 x 10° Pa (about 
32 psi). How much more internal energy does this gas have than the same 


volume has at zero gauge pressure (which is equivalent to normal 
atmospheric pressure)? 


Exercise: 
Problem: 
A helium-filled toy balloon has a gauge pressure of 0.200 atm and a volume 


of 10.0 L. How much greater is the internal energy of the helium in the 
balloon than it would be at zero gauge pressure? 


Solution: 


300 J 
Exercise: 


Problem: 


Steam to drive an old-fashioned steam locomotive is supplied at a constant 
gauge pressure of 1.75 x 10°N/ m? (about 250 psi) to a piston with a 
0.200-m radius. (a) By calculating pAV, find the work done by the steam 
when the piston moves 0.800 m. Note that this is the net work output, since 
gauge pressure is used. (b) Now find the amount of work by calculating the 
force exerted times the distance traveled. Is the answer the same as in part 


(a)? 


Exercise: 


Problem: 


A hand-driven tire pump has a piston with a 2.50-cm diameter and a 
maximum stroke of 30.0 cm. (a) How much work do you do in one stroke if 
the average gauge pressure is 2.4 x 10°N/ m” (about 35 psi)? (b) What 
average force do you exert on the piston, neglecting friction and 
gravitational force? 


Solution: 


a: 59.5 J: b. 170.N 
Exercise: 


Problem: 


Calculate the net work output of a heat engine following path ABCDA as 
shown below. 


p (10° Nim?) 


0 1.0 49 V (10-3 m3) 


Exercise: 
Problem: 
What is the net work output of a heat engine that follows path ABDA in the 


preceding problem with a straight line from B to D? Why is the work output 
less than for path ABCDA? 


Solution: 


2A 50? J 
Exercise: 
Problem: 
Five moles of a monatomic ideal gas in a cylinder at 27 °C is expanded 
isothermally from a volume of 5 L to 10 L. (a) What is the change in 


internal energy? (b) How much work was done on the gas in the process? 
(c) How much heat was transferred to the gas? 


Exercise: 
Problem: 
Four moles of a monatomic ideal gas in a cylinder at 27 “C is expanded at 
constant pressure equal to 1 atm until its volume doubles. (a) What is the 


change in internal energy? (b) How much work was done by the gas in the 
process? (c) How much heat was transferred to the gas? 


Solution: 


a. 15,000 J; b. 10,000 J; c. 25,000 J 
Exercise: 
Problem: 
Helium gas is cooled from 20 °C to 10 °C by expanding from 40 atm to 1 


atm. If there is 1.4 mol of helium, (a) What is the final volume of helium? 
(b) What is the change in internal energy? 


Exercise: 


Problem: 


In an adiabatic process, oxygen gas in a container is compressed along a 
path that can be described by the following pressure in atm as a function of 
volume V, with Vo = 1L: p = (3.0 atm)(V/Vo) 1. The initial and final 
volumes during the process were 2 L and 1.5 L, respectively. Find the 
amount of work done on the gas. 


Solution: 


78 J 
Exercise: 


Problem: 


A cylinder containing three moles of a monatomic ideal gas is heated at a 
constant pressure of 2 atm. The temperature of the gas changes from 300 K 
to 350 K as a result of the expansion. Find work done (a) on the gas; and (b) 
by the gas. 


Exercise: 


Problem: 


A cylinder containing three moles of nitrogen gas is heated at a constant 
pressure of 2 atm. The temperature of the gas changes from 300 K to 350 K 
as a result of the expansion. Find work done (a) on the gas, and (b) by the 
gas by using van der Waals equation of state instead of ideal gas law. 


Solution: 


A cylinder containing three moles of nitrogen gas is heated at a constant 
pressure of 2 atm. a. -1220 J; b. +1220 J 


Exercise: 


Problem: 


Two moles of a monatomic ideal gas such as helium is compressed 
adiabatically and reversibly from a state (3 atm, 5 L) to a state with a 
pressure of 4 atm. (a) Find the volume and temperature of the final state. (b) 
Find the temperature of the initial state. (c) Find work done by the gas in the 
process. (d) Find the change in internal energy in the process. Assume 

Cy = 5Rand C, = Cy + R for the diatomic ideal gas in the conditions 
given. 


Exercise: 


Problem: 


An insulated vessel contains 1.5 moles of argon at 2 atm. The gas initially 
occupies a volume of 5 L. As a result of the adiabatic expansion the 
pressure of the gas is reduced to 1 atm. (a) Find the volume and temperature 
of the final state. (b) Find the temperature of the gas in the initial state. (c) 
Find the work done by the gas in the process. (d) Find the change in the 
internal energy of the gas in the process. 


Solution: 


a. 7.6 L, 61.6 K; b. 81.3 K; c. 3.63 L - atm = 367 J; d. -367 J 


Challenge Problems 


Exercise: 


Problem: 

One mole of an ideal monatomic gas occupies a volume of 1.0 x 10-7 m? 
at a pressure of 2.0 x 10°N / m’. (a) What is the temperature of the gas? 
(b) The gas undergoes a quasi-static adiabatic compression until its volume 
is decreased to 5.0 x 10° m®. What is the new gas temperature? (c) How 
much work is done on the gas during the compression? (d) What is the 
change in the internal energy of the gas? 


Exercise: 


Problem: 


One mole of an ideal gas is initially in a chamber of volume 

1.0 x 10°? m? and at a temperature of 27 °C. (a) How much heat is 
absorbed by the gas when it slowly expands isothermally to twice its initial 
volume? (b) Suppose the gas is slowly transformed to the same final state 
by first decreasing the pressure at constant volume and then expanding it 
isobarically. What is the heat transferred for this case? (c) Calculate the heat 
transferred when the gas is transformed quasi-statically to the same final 
state by expanding it isobarically, then decreasing its pressure at constant 
volume. 


Solution: 


a. 1700 J; b. 1200 J; c. 2400 J 
Exercise: 


Problem: 


A bullet of mass 10 g is traveling horizontally at 200 m/s when it strikes and 
embeds in a pendulum bob of mass 2.0 kg. (a) How much mechanical 
energy is dissipated in the collision? (b) Assuming that C’, for the bob plus 
bullet is 3R, calculate the temperature increase of the system due to the 
collision. Take the molecular mass of the system to be 200 g/mol. 


Exercise: 


Problem: 


The insulated cylinder shown below is closed at both ends and contains an 
insulating piston that is free to move on frictionless bearings. The piston 
divides the chamber into two compartments containing gases A and B. 
Originally, each compartment has a volume of 5.0 x 107-7 m° and 
contains a monatomic ideal gas at a temperature of 0 °C and a pressure of 
1.0 atm. (a) How many moles of gas are in each compartment? (b) Heat Q is 
slowly added to A so that it expands and B is compressed until the pressure 
of both gases is 3.0 atm. Use the fact that the compression of B is adiabatic 
to determine the final volume of both gases. (c) What are their final 
temperatures? (d) What is the value of Q? 


Solution: 


a. 2.2 mol; b. V4 = 2.6 x 10°-2m?, Vp = 7.4 x 1072m?:c. 
T4 = 1220K, Tg = 430 K; d. 30,500 J 


Exercise: 


Problem: 


In a diesel engine, the fuel is ignited without a spark plug. Instead, air in a 
cylinder is compressed adiabatically to a temperature above the ignition 
temperature of the fuel; at the point of maximum compression, the fuel is 
injected into the cylinder. Suppose that air at 20 °C is taken into the 
cylinder at a volume Vj, and then compressed adiabatically and quasi- 
statically to a temperature of 600 “C and a volume V3. If ~ = 1.4, what is 
the ratio V; /V2? (Note: In an operating diesel engine, the compression is 
not quasi-static.) 


Introduction 
class="introduction' 


A xenon ion 
engine from 
the Jet 
Propulsion 
Laboratory 
shows the 
faint blue 
glow of 
charged 
atoms 
emitted 
from the 
engine. The 
ion 
propulsion 
engine is the 
first 
nonchemica 
| propulsion 
to be used 
as the 
primary 
means of 
propelling a 
spacecraft. 
(credit: 
modificatio 
n of work 
by 
NASA/JPL) 


According to the first law of thermodynamics, the only processes that can 
occur are those that conserve energy. But this cannot be the only restriction 
imposed by nature, because many seemingly possible thermodynamic 
processes that would conserve energy do not occur. For example, when two 
bodies are in thermal contact, heat never flows from the colder body to the 
warmer one, even though this is not forbidden by the first law. So some 
other thermodynamic principles must be controlling the behavior of 
physical systems. 


One such principle is the second law of thermodynamics, which limits the 
use of energy within a source. Energy cannot arbitrarily pass from one 
object to another, just as we cannot transfer heat from a cold object to a hot 
one without doing any work. We cannot unmix cream from coffee without a 
chemical process that changes the physical characteristics of the system or 
its environment. We cannot use internal energy stored in the air to propel a 
car, or use the energy of the ocean to run a ship, without disturbing 
something around that object. 


In the chapter covering the first law of thermodynamics, we started our 
discussion with a joke by C. P. Snow stating that the first law means “you 
can’t win.” He paraphrased the second law as “you can’t break even, except 
on a very cold day.” Unless you are at zero kelvin, you cannot convert 
100% of thermal energy into work. We start by discussing spontaneous 


processes and explain why some processes require work to occur even if 
energy would have been conserved. 


Reversible and Irreversible Processes 
By the end of this section, you will be able to: 


e Define reversible and irreversible processes 
e State the second law of thermodynamics via an irreversible process 


Consider an ideal gas that is held in half of a thermally insulated container 
by a wall in the middle of the container. The other half of the container is 
under vacuum with no molecules inside. Now, if we remove the wall in the 
middle quickly, the gas expands and fills up the entire container 
immediately, as shown in [Link]. 


Vacuum 


6 
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Gas fills the container 


(b) 


A gas expanding from half of a container to the entire container (a) 
before and (b) after the wall in the middle is removed. 


Because half of the container is under vacuum before the gas expands there, 
we do not expect any work to be done by the system—that is, W = 0— 
because no force from the vacuum is exerted on the gas during the 
expansion. If the container is thermally insulated from the rest of the 
environment, we do not expect any heat transfer to the system either, so 

Q = 0. Then the first law of thermodynamics leads to the change of the 
internal energy of the system, 

Equation: 


AEFin = Q—W =0. 


For an ideal gas, if the internal energy doesn’t change, then the temperature 
stays the same. Thus, the equation of state of the ideal gas gives us the final 
pressure of the gas, p = nRT'/V = po/2, where pg is the pressure of the 
gas before the expansion. The volume is doubled and the pressure is halved, 
but nothing else seems to have changed during the expansion. 


All of this discussion is based on what we have learned so far and makes 
sense. Here is what puzzles us: Can all the molecules go backward to the 
original half of the container in some future time? Our intuition tells us that 
this is going to be very unlikely, even though nothing we have learned so 
far prevents such an event from happening, regardless of how small the 
probability is. What we are really asking is whether the expansion into the 
vacuum half of the container is reversible. 


A reversible process is a process in which the system and environment can 
be restored to exactly the same initial states that they were in before the 
process occurred, if we go backward along the path of the process. The 
necessary condition for a reversible process is therefore the quasi-static 
requirement. Note that it is quite easy to restore a system to its original 
state; the hard part is to have its environment restored to its original state at 
the same time. For example, in the example of an ideal gas expanding into 
vacuum to twice its original volume, we can easily push it back with a 
piston and restore its temperature and pressure by removing some heat from 
the gas. The problem is that we cannot do it without changing something in 
its surroundings, such as dumping some heat there. 


A reversible process is truly an ideal process that rarely happens. We can 
make certain processes close to reversible and therefore use the 
consequences of the corresponding reversible processes as a starting point 
or reference. In reality, almost all processes are irreversible, and some 
properties of the environment are altered when the properties of the system 
are restored. The expansion of an ideal gas, as we have just outlined, is 
irreversible because the process is not even quasi-static, that is, not in an 
equilibrium state at any moment of the expansion. 


From the microscopic point of view, a particle described by Newton’s 
second law can go backward if we flip the direction of time. But this is not 


the case, in practical terms, in a macroscopic system with more than 1078 
particles or molecules, where numerous collisions between these molecules 
tend to erase any trace of memory of the initial trajectory of each of the 
particles. For example, we can actually estimate the chance for all the 
particles in the expanded gas to go back to the original half of the container, 
but the current age of the universe is still not long enough for it to happen 
even once. 


An irreversible process is what we encounter in reality almost all the time. 
The system and its environment cannot be restored to their original states at 
the same time. Because this is what happens in nature, it is also called a 
natural process. The sign of an irreversible process comes from the finite 
gradient between the states occurring in the actual process. For example, 
when heat flows from one object to another, there is a finite temperature 
difference (gradient) between the two objects. More importantly, at any 
given moment of the process, the system most likely is not at equilibrium or 
in a well-defined state. This phenomenon is called irreversibility. 


Let us see another example of irreversibility in thermal processes. Consider 
two objects in thermal contact: one at temperature 7} and the other at 
temperature 7, > 7}, as shown in [link]. 


Spontaneous heat flow from an object at 
higher temperature T> to another at lower 
temperature T}. 


We know from common personal experience that heat flows from a hotter 
object to a colder one. For example, when we hold a few pieces of ice in 


our hands, we feel cold because heat has left our hands into the ice. The 
opposite is true when we hold one end of a metal rod while keeping the 
other end over a fire. Based on all of the experiments that have been done 
on spontaneous heat transfer, the following statement summarizes the 
governing principle: 


Note: 
Second Law of Thermodynamics (Clausius statement) 
Heat never flows spontaneously from a colder object to a hotter object. 


This statement turns out to be one of several different ways of stating the 
second law of thermodynamics. The form of this statement is credited to 
German physicist Rudolf Clausius (1822-1888) and is referred to as the 
Clausius statement of the second law of thermodynamics. The word 
“spontaneously” here means no other effort has been made by a third party, 
or one that is neither the hotter nor colder object. We will introduce some 
other major statements of the second law and show that they imply each 
other. In fact, all the different statements of the second law of 
thermodynamics can be shown to be equivalent, and all lead to the 
irreversibility of spontaneous heat flow between macroscopic objects of a 
very large number of molecules or particles. 


Both isothermal and adiabatic processes sketched on a pV graph (discussed 
in The First Law of Thermodynamics) are reversible in principle because 
the system is always at an equilibrium state at any point of the processes 
and can go forward or backward along the given curves. Other idealized 
processes can be represented by pV curves; [link] summarizes the most 
common reversible processes. 


Process Constant Quantity and Resulting Fact 


Isobaric Constant pressure W = pAV 
Isochoric Constant volume W = 0 
Isothermal Constant temperature AT = 0 
Adiabatic No heat transfer Q = 0 


Summary of Simple Thermodynamic Processes 


Summary 


e A reversible process is one in which both the system and its 
environment can return to exactly the states they were in by following 
the reverse path. 

e An irreversible process is one in which the system and its environment 
cannot return together to exactly the states that they were in. 

e The irreversibility of any natural process results from the second law 
of thermodynamics. 


Conceptual Questions 


Exercise: 


Problem: 


State an example of a process that occurs in nature that is as close to 
reversible as it can be. 


Solution: 
Some possible solutions are frictionless movement; restrained 


compression or expansion; energy transfer as heat due to infinitesimal 
temperature nonuniformity; electric current flow through a zero 


resistance; restrained chemical reaction; and mixing of two samples of 
the same substance at the same state. 


Problems 


Exercise: 
Problem: 
A tank contains 111.0 g chlorine gas (Cl), which is at temperature 
82.0 °C and absolute pressure 5.70 x 10° Pa. The temperature of the 
air outside the tank is 20.0 “C. The molar mass of Cl is 70.9 g/mol. 
(a) What is the volume of the tank? (b) What is the internal energy of 
the gas? (c) What is the work done by the gas if the temperature and 


pressure inside the tank drop to 31.0 °C and 3.80 x 10° Pa, 
respectively, due to a leak? 


Exercise: 
Problem: 
A mole of ideal monatomic gas at 0 “C and 1.00 atm is warmed up to 


expand isobarically to triple its volume. How much heat is transferred 
during the process? 


Solution: 


110010? J 
Exercise: 
Problem: 
A mole of an ideal gas at pressure 4.00 atm and temperature 298 K 


expands isothermally to double its volume. What is the work done by 
the gas? 


Exercise: 


Problem: 


After a free expansion to quadruple its volume, a mole of ideal 
diatomic gas is compressed back to its original volume adiabatically 
and then cooled down to its original temperature. What is the 
minimum heat removed from the gas in the final step to restoring its 
state? 


Solution: 


4.5 pVo 


Glossary 


Clausius statement of the second law of thermodynamics 
heat never flows spontaneously from a colder object to a hotter object 


irreversibility 
phenomenon associated with a natural process 


irreversible process 
process in which neither the system nor its environment can be 
restored to their original states at the same time 


reversible process 
process in which both the system and the external environment 
theoretically can be returned to their original states 


Heat Engines 
By the end of this section, you will be able to: 


e Describe the function and components of a heat engine 
e Explain the efficiency of an engine 
e Calculate the efficiency of an engine for a given cycle of an ideal gas 


A heat engine is a device used to extract heat from a source and then 
convert it into mechanical work that is used for all sorts of applications. For 
example, a steam engine on an old-style train can produce the work needed 
for driving the train. Several questions emerge from the construction and 
application of heat engines. For example, what is the maximum percentage 
of the heat extracted that can be used to do work? This turns out to be a 
question that can only be answered through the second law of 
thermodynamics. 


The second law of thermodynamics can be formally stated in several ways. 
One statement presented so far is about the direction of spontaneous heat 
flow, known as the Clausius statement. A couple of other statements are 
based on heat engines. Whenever we consider heat engines and associated 
devices such as refrigerators and heat pumps, we do not use the normal 
sign convention for heat and work. For convenience, we assume that the 
symbols @, Q-, and W represent only the amounts of heat transferred and 
work delivered, regardless what the givers or receivers are. Whether heat is 
entering or leaving a system and work is done to or by a system are 
indicated by proper signs in front of the symbols and by the directions of 
arrows in diagrams. 


It turns out that we need more than one heat source/sink to construct a heat 
engine. We will come back to this point later in the chapter, when we 
compare different statements of the second law of thermodynamics. For the 
moment, we assume that a heat engine is constructed between a heat source 
(high-temperature reservoir or hot reservoir) and a heat sink (low- 
temperature reservoir or cold reservoir), represented schematically in [link]. 
The engine absorbs heat @,, from a heat source (hot reservoir) of Kelvin 
temperature 7},, uses some of that energy to produce useful work W, and 
then discards the remaining energy as heat Q, into a heat sink (cold 
reservoir) of Kelvin temperature 7(. Power plants and internal combustion 


engines are examples of heat engines. Power plants use steam produced at 
high temperature to drive electric generators, while exhausting heat to the 
atmosphere or a nearby body of water in the role of the heat sink. In an 
internal combustion engine, a hot gas-air mixture is used to push a piston, 
and heat is exhausted to the nearby atmosphere in a similar manner. 

Th 


Qh 


Engine 


A schematic representation of a 
heat engine. Energy flows from 
the hot reservoir to the cold 
reservoir while doing work. 


Actual heat engines have many different designs. Examples include internal 
combustion engines, such as those used in most cars today, and external 
combustion engines, such as the steam engines used in old steam-engine 
trains. [link] shows a photo of a nuclear power plant in operation. The 
atmosphere around the reactors acts as the cold reservoir, and the heat 
generated from the nuclear reaction provides the heat from the hot reservoir. 


The heat exhausted from a nuclear power plant goes 
to the cooling towers, where it is released into the 
atmosphere. 


Heat engines operate by carrying a working substance through a cycle. Ina 
steam power plant, the working substance is water, which starts as a liquid, 
becomes vaporized, is then used to drive a turbine, and is finally condensed 
back into the liquid state. As is the case for all working substances in cyclic 
processes, once the water returns to its initial state, it repeats the same 
sequence. 


For now, we assume that the cycles of heat engines are reversible, so there 
is no energy loss to friction or other irreversible effects. Suppose that the 
engine of [link] goes through one complete cycle and that Qy,, Q., and W 
represent the heats exchanged and the work done for that cycle. Since the 
initial and final states of the system are the same, A Fint = O for the cycle. 
We therefore have from the first law of thermodynamics, 

Equation: 


W = Q —- AE nt = (Qn — Q-) — 0, 


so that 


Note: 
Equation: 


W = Qh - Qe. 


The most important measure of a heat engine is its efficiency (e), which is 
simply “what we get out” divided by “what we put in” during each cycle, as 
defined by e = Wout /Qin- 


With a heat engine working between two heat reservoirs, we get out W and 
put in Qy, so the efficiency of the engine is 


Note: 
Equation: 


Here, we used [link], W = Qn — Q,, in the final step of this expression for 
the efficiency. 


Example: 

A Lawn Mower 

A lawn mower is rated to have an efficiency of 25.0% and an average 
power of 3.00 kW. What are (a) the average work and (b) the minimum 
heat discharge into the air by the lawn mower in one minute of use? 
Strategy 

From the average power—that is, the rate of work production—we can 
figure out the work done in the given elapsed time. Then, from the 


efficiency given, we can figure out the minimum heat discharge 
Q. = Qn(1 — e) with Qn = Q. + W. 
Solution 


a. The average work delivered by the lawn mower is 
Equation: 


W = PAt = 3.00 x 10° x 60 x 1.00J = 180kJ. 


b. The minimum heat discharged into the air is given by 
Equation: 


Q. = Q,(1 = e) = (Q. a w)(1 = e) 


which leads to 
Equation: 


Q, = W(1/e — 1) = 180 x (1/0.25 — 1) kJ = 540kJ. 


Significance 
As the efficiency rises, the minimum heat discharged falls. This helps our 
environment and atmosphere by not having as much waste heat expelled. 


Summary 


e The work done by a heat engine is the difference between the heat 
absorbed from the hot reservoir and the heat discharged to the cold 
reservoir, that is, W = Qn — Qa. 

e The ratio of the work done by the engine and the heat absorbed from 
the hot reservoir provides the efficiency of the engine, that is, 


e=W/Qn=1-Q-/Qn. 


Conceptual Questions 


Exercise: 


Problem: 


Explain in practical terms why efficiency is defined as W/Qy. 


Problems 


Exercise: 
Problem: 
An engine is found to have an efficiency of 0.40. If it does 200 J of 


work per cycle, what are the corresponding quantities of heat absorbed 
and discharged? 


Exercise: 
Problem: 


In performing 100.0 J of work, an engine discharges 50.0 J of heat. 
What is the efficiency of the engine? 


Solution: 


0.667 
Exercise: 
Problem: 
An engine with an efficiency of 0.30 absorbs 500 J of heat per cycle. 


(a) How much work does it perform per cycle? (b) How much heat 
does it discharge per cycle? 


Exercise: 
Problem: 
It is found that an engine discharges 100.0 J while absorbing 125.0 J 


each cycle of operation. (a) What is the efficiency of the engine? (b) 
How much work does it perform per cycle? 


Solution: 


av 0:200: b..25,) 
Exercise: 
Problem: 
The temperature of the cold reservoir of the engine is 300 K. It has an 
efficiency of 0.30 and absorbs 500 J of heat per cycle. (a) How much 


work does it perform per cycle? (b) How much heat does it discharge 
per cycle? 


Exercise: 
Problem: 
An engine absorbs three times as much heat as it discharges. The work 
done by the engine per cycle is 50 J. Calculate (a) the efficiency of the 


engine, (b) the heat absorbed per cycle, and (c) the heat discharged per 
cycle. 


Solution: 


a. 0.67; b. 75 J; c. 25 J 
Exercise: 


Problem: 

A coal power plant consumes 100,000 kg of coal per hour and 
produces 500 MW of power. If the heat of combustion of coal is 30 
MJ/kg, what is the efficiency of the power plant? 


Glossary 


cold reservoir 
sink of heat used by a heat engine 


efficiency (e) 
output work from the engine over the input heat to the engine from the 
hot reservoir 


heat engine 
device that converts heat into work 


hot reservoir 
source of heat used by a heat engine 


Refrigerators and Heat Pumps 
By the end of this section, you will be able to: 


e Describe a refrigerator and a heat pump and list their differences 
¢ Calculate the performance coefficients of simple refrigerators and heat 
pumps 


The cycles we used to describe the engine in the preceding section are all 
reversible, so each sequence of steps can just as easily be performed in the 
opposite direction. In this case, the engine is known as a refrigerator or a 
heat pump, depending on what is the focus: the heat removed from the cold 
reservoir or the heat dumped to the hot reservoir. Either a refrigerator or a 
heat pump is an engine running in reverse. For a refrigerator, the focus is 
on removing heat from a specific area. For a heat pump, the focus is on 
dumping heat to a specific area. 


We first consider a refrigerator ([link]). The purpose of this engine is to 
remove heat from the cold reservoir, which is the space inside the 
refrigerator for an actual household refrigerator or the space inside a 
building for an air-conditioning unit. 

Th 


Refrigerator Qh 
or heat pump 


A schematic representation of a 
refrigerator (or a heat pump). The 
arrow next to work (W) indicates work 
being put into the system. 


A refrigerator (or heat pump) absorbs heat @, from the cold reservoir at 
Kelvin temperature 7, and discards heat @y to the hot reservoir at Kelvin 
temperature 7},, while work W is done on the engine’s working substance, 
as shown by the arrow pointing toward the system in the figure. A 
household refrigerator removes heat from the food within it while 
exhausting heat to the surrounding air. The required work, for which we pay 
in our electricity bill, is performed by the motor that moves a coolant 
through the coils. A schematic sketch of a household refrigerator is given in 
[link]. 


Cool fluid 


Expansion 
Evaporator valve 


Warm fluid 


Low pressure Radiator fins 


Condenser 


High pressure 
vapor 


Compressor pump 


A schematic diagram of a household refrigerator. A coolant with a 
boiling temperature below the freezing point of water is sent through 
the cycle (clockwise in this diagram). The coolant extracts heat from 

the refrigerator at the evaporator, causing coolant to vaporize. It is then 


compressed and sent through the condenser, where it exhausts heat to 
the outside. 


The effectiveness or coefficient of performance KR of a refrigerator is 
measured by the heat removed from the cold reservoir divided by the work 
done by the working substance cycle by cycle: 


Note: 
Equation: 


Note that we have used the condition of energy conservation, 
W = Qh — Q,, in the final step of this expression. 


The effectiveness or coefficient of performance Kp of a heat pump is 
measured by the heat dumped to the hot reservoir divided by the work done 
to the engine on the working substance cycle by cycle: 


Note: 
Equation: 


Once again, we use the energy conservation condition W = @, — Q, to 
obtain the final step of this expression. 


Summary 


e A refrigerator or a heat pump is a heat engine run in reverse. 

e The focus of a refrigerator is on removing heat from the cold reservoir 
with a coefficient of performance Kr. 

e The focus of a heat pump is on dumping heat to the hot reservoir with 
a coefficient of performance Kp. 


Conceptual Questions 


Exercise: 


Problem: 


If the refrigerator door is left open, what happens to the temperature of 
the kitchen? 


Solution: 


The temperature increases since the heat output behind the refrigerator 

is greater than the cooling from the inside of the refrigerator. 
Exercise: 

Problem: 

Is it possible for the efficiency of a reversible engine to be greater than 


1.0? Is it possible for the coefficient of performance of a reversible 
refrigerator to be less than 1.0? 


Problems 


Exercise: 


Problem: 


A refrigerator has a coefficient of performance of 3.0. (a) If it requires 
200 J of work per cycle, how much heat per cycle does it remove the 
cold reservoir? (b) How much heat per cycle is discarded to the hot 
reservoir? 


Solution: 


a. 600 J; b. 800 J 
Exercise: 
Problem: 
During one cycle, a refrigerator removes 500 J from a cold reservoir 


and discharges 800 J to its hot reservoir. (a) What is its coefficient of 
performance? (b) How much work per cycle does it require to operate? 


Exercise: 
Problem: 
If a refrigerator discards 80 J of heat per cycle and its coefficient of 
performance is 6.0, what are (a) the quantity off heat it removes per 


cycle from a cold reservoir and (b) the amount of work per cycle 
required for its operation? 


Solution: 


a. 69 J; b. 11 J 
Exercise: 
Problem: 
A refrigerator has a coefficient of performance of 3.0. (a) If it requires 
200 J of work per cycle, how much heat per cycle does it remove the 


cold reservoir? (b) How much heat per cycle is discarded to the hot 
reservoir? 


Glossary 


coefficient of performance 
measure of effectiveness of a refrigerator or heat pump 


heat pump 
device that delivers heat to a hot reservoir 


refrigerator 
device that removes heat from a cold reservoir 


Statements of the Second Law of Thermodynamics 
By the end of this section, you will be able to: 


¢ Contrast the second law of thermodynamics statements according to 
Kelvin and Clausius formulations 
e Interpret the second of thermodynamics via irreversibility 


Earlier in this chapter, we introduced the Clausius statement of the second 
law of thermodynamics, which is based on the irreversibility of spontaneous 
heat flow. As we remarked then, the second law of thermodynamics can be 
stated in several different ways, and all of them can be shown to imply the 
others. In terms of heat engines, the second law of thermodynamics may be 
stated as follows: 


Note: 

Second Law of Thermodynamics (Kelvin statement) 

It is impossible to convert the heat from a single source into work without 
any other effect. 


This is known as the Kelvin statement of the second law of 
thermodynamics. This statement describes an unattainable “perfect 
engine,” as represented schematically in [link](a). Note that “without any 
other effect” is a very strong restriction. For example, an engine can absorb 
heat and turn it all into work, but not if it completes a cycle. Without 
completing a cycle, the substance in the engine is not in its original state 
and therefore an “other effect” has occurred. Another example is a chamber 
of gas that can absorb heat from a heat reservoir and do work isothermally 
against a piston as it expands. However, if the gas were returned to its initial 
State (that is, made to complete a cycle), it would have to be compressed 
and heat would have to be extracted from it. 


The Kelvin statement is a manifestation of a well-known engineering 
problem. Despite advancing technology, we are not able to build a heat 


engine that is 100% efficient. The first law does not exclude the possibility 
of constructing a perfect engine, but the second law forbids it. 


Th | Th | 
Perfect Q Perfect 
, Q 
heat engine refrigerator 
Bo WwW 
QOQ=W Q 
(a) (b) 


(a) A “perfect heat engine” converts all input heat into work. (b) A 
“perfect refrigerator” transports heat from a cold reservoir to a hot 
reservoir without work input. Neither of these devices is achievable in 
reality. 


We can show that the Kelvin statement is equivalent to the Clausius 
statement if we view the two objects in the Clausius statement as a cold 
reservoir and a hot reservoir. Thus, the Clausius statement becomes: It is 
impossible to construct a refrigerator that transfers heat from a cold 
reservoir to a hot reservoir without aid from an external source. The 
Clausius statement is related to the everyday observation that heat never 
flows spontaneously from a cold object to a hot object. Heat transfer in the 
direction of increasing temperature always requires some energy input. A 
“perfect refrigerator,” shown in [link](b), which works without such 
external aid, is impossible to construct. 


To prove the equivalence of the Kelvin and Clausius statements, we show 
that if one statement is false, it necessarily follows that the other statement 
is also false. Let us first assume that the Clausius statement is false, so that 
the perfect refrigerator of [link](b) does exist. The refrigerator removes heat 


Q from a cold reservoir at a temperature J; and transfers all of it to a hot 
reservoir at a temperature 7},. Now consider a real heat engine working in 
the same temperature range. It extracts heat 2 + AQ from the hot 
reservoir, does work W, and discards heat Q to the cold reservoir. From the 
first law, these quantities are related by W = (Q + AQ) —-Q = AQ. 


Suppose these two devices are combined as shown in [link]. The net heat 
removed from the hot reservoir is AQ, no net heat transfer occurs to or 
from the cold reservoir, and work W is done on some external body. Since 
W = AQ, the combination of a perfect refrigerator and a real heat engine 
is itself a perfect heat engine, thereby contradicting the Kelvin statement. 
Thus, if the Clausius statement is false, the Kelvin statement must also be 
false. 


Combining a perfect refrigerator and a real heat 


engine yields a perfect heat engine because 
W=AQ. 


Using the second law of thermodynamics, we now prove two important 
properties of heat engines operating between two heat reservoirs. The first 
property is that any reversible engine operating between two reservoirs has 
a greater efficiency than any irreversible engine operating between the 
same two reservoirs. 


The second property to be demonstrated is that all reversible engines 
operating between the same two reservoirs have the same efficiency. To 
show this, we start with the two engines D and E of [link](a), which are 
operating between two common heat reservoirs at temperatures T}, and T¢. 
First, we assume that D is a reversible engine and that E is a hypothetical 
irreversible engine that has a higher efficiency than D. If both engines 
perform the same amount of work W per cycle, it follows from [link] that 


Qn > Qj. It then follows from the first law that Q, > Qt. 
% E 


Qh On 


(a) 


(a) Two uncoupled engines D and E working between the same 
reservoirs. (b) The coupled engines, with D working in reverse. 


Suppose the cycle of D is reversed so that it operates as a refrigerator, and 
the two engines are coupled such that the work output of E is used to drive 
D, as shown in [link](b). Since Qy > Qj}, and Q, > QI, the net result of 
each cycle is equivalent to a spontaneous transfer of heat from the cold 
reservoir to the hot reservoir, a process the second law does not allow. The 
original assumption must therefore be wrong, and it is impossible to 
construct an irreversible engine such that E is more efficient than the 
reversible engine D. 


Now it is quite easy to demonstrate that the efficiencies of all reversible 
engines operating between the same reservoirs are equal. Suppose that D 
and E are both reversible engines. If they are coupled as shown in [link](b), 
the efficiency of E cannot be greater than the efficiency of D, or the second 
law would be violated. If both engines are then reversed, the same 


reasoning implies that the efficiency of D cannot be greater than the 
efficiency of E. Combining these results leads to the conclusion that all 
reversible engines working between the same two reservoirs have the same 
efficiency. 


Note: 
Exercise: 


Problem: 
Check Your Understanding What is the efficiency of a perfect heat 


engine? What is the coefficient of performance of a perfect 
refrigerator? 


Solution: 


A perfect heat engine would have Q,. = 0, which would lead to 
e=1-—Q,./Qn = 1. A perfect refrigerator would need zero work, 
that is, W = 0, which leads to Kp = Q,./W — oo. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Show that Qn — Q), = Q. — Q, for the 
hypothetical engine of [link](b). 


Solution: 


From the engine on the right, we have W = Q;, — Q\.. From the 
refrigerator on the right, we have Q, = Q, + W. Thus, 
W = Q,- QM = Qn - Qe. 


Summary 


e The Kelvin statement of the second law of thermodynamics: It is 
impossible to convert the heat from a single source into work without 
any other effect. 

e The Kelvin statement and Clausius statement of the second law of 
thermodynamics are equivalent. 


Conceptual Questions 


Exercise: 


Problem: 


In the text, we showed that if the Clausius statement is false, the 
Kelvin statement must also be false. Now show the reverse, such that 
if the Kelvin statement is false, it follows that the Clausius statement is 
false. 


Solution: 


If we combine a perfect engine and a real refrigerator with the engine 
converting heat Q from the hot reservoir into work W = Q to drive 
the refrigerator, then the heat dumped to the hot reservoir by the 
refrigerator will be W + AQ, resulting in a perfect refrigerator 
transferring heat AQ from the cold reservoir to hot reservoir without 
any other effect. 


Exercise: 
Problem: 
Why don’t we operate ocean liners by extracting heat from the ocean 
or operate airplanes by extracting heat from the atmosphere? 


Exercise: 


Problem: 


Discuss the practical advantages and disadvantages of heat pumps and 
electric heating. 


Solution: 


Heat pumps can efficiently extract heat from the ground to heat on 
cooler days or pull heat out of the house on warmer days. The 
disadvantage of heat pumps are that they are more costly than 
alternatives, require maintenance, and will not work efficiently when 
temperature differences between the inside and outside are very large. 
Electric heating is much cheaper to purchase than a heat pump; 
however, it may be more costly to run depending on the electric rates 
and amount of usage. 


Exercise: 
Problem: 
The energy output of a heat pump is greater than the energy used to 


operate the pump. Why doesn’t this statement violate the first law of 
thermodynamics? 


Exercise: 
Problem: 


Speculate as to why nuclear power plants are less efficient than fossil- 
fuel plants based on temperature arguments. 


Solution: 


A nuclear reactor needs to have a lower temperature to operate, so its 
efficiency will not be as great as a fossil-fuel plant. This argument 
does not take into consideration the amount of energy per reaction: 
Nuclear power has a far greater energy output than fossil fuels. 


Exercise: 


Problem: 


An ideal gas goes from state (p;, V;) to state (p-, Ve) when it is 
allowed to expand freely. Is it possible to represent the actual process 
on a pV diagram? Explain. 


Glossary 


Kelvin statement of the second law of thermodynamics 
it is impossible to convert the heat from a single source into work 
without any other effect 


perfect engine 
engine that can convert heat into work with 100% efficiency 


perfect refrigerator (heat pump) 
refrigerator (heat pump) that can remove (dump) heat without any 
input of work 


The Carnot Cycle 


e Describe the Carnot cycle with the roles of all four processes involved 

¢ Outline the Carnot principle and its implications 

e Demonstrate the equivalence of the Carnot principle and the second law of 
thermodynamics 


In the early 1820s, Sadi Carnot (1786-1832), a French engineer, became 
interested in improving the efficiencies of practical heat engines. In 1824, his 
studies led him to propose a hypothetical working cycle with the highest possible 
efficiency between the same two reservoirs, known now as the Carnot cycle. An 
engine operating in this cycle is called a Carnot engine. The Carnot cycle is of 
special importance for a variety of reasons. At a practical level, this cycle 
represents a reversible model for the steam power plant and the refrigerator or 
heat pump. Yet, it is also very important theoretically, for it plays a major role in 
the development of another important statement of the second law of 
thermodynamics. Finally, because only two reservoirs are involved in its 
operation, it can be used along with the second law of thermodynamics to define 
an absolute temperature scale that is truly independent of any substance used for 
temperature Measurement. 


With an ideal gas as the working substance, the steps of the Carnot cycle, as 
represented by [link], are as follows. 


1. Isothermal expansion. The gas is placed in thermal contact with a heat 
reservoir at a temperature 7},. The gas absorbs heat @, from the heat 
reservoir and is allowed to expand isothermally, doing work W,. Because 
the internal energy F,; of an ideal gas is a function of the temperature only, 
the change of the internal energy is zero, that is, AFin, = 0 during this 
isothermal expansion. With the first law of thermodynamics, 

AEin, = Q — W, we find that the heat absorbed by the gas is 
Equation: 


V, 
Qn = W, = nRT} In—. 
Vu 


(Temperature 7,,) (Temperature T.) 


Step 1: Step 2: Step 3: Step 4: 
Isothermal Adiabatic Isothermal Adiabatic 
expansion expansion compression compression 

(MN) (NO) (O-P) (PM) 


The four processes of the Carnot cycle. The working substance is 
assumed to be an ideal gas whose thermodynamic path MNOP is 
represented in [link]. 


The total work done by the gas in the Carnot cycle is 


shown and given by the area enclosed by the loop 
MNOPM. 


2. Adiabatic expansion. The gas is thermally isolated and allowed to expand 
further, doing work W3. Because this expansion is adiabatic, the temperature 
of the gas falls—in this case, from Tj, to J.. From pV’ = constant and the 
equation of state for an ideal gas, pV = n RT’, we have 
Equation: 


TV’~! = constant, 


so that 
Equation: 


ThVy 1 = T.Vot. 


3. Isothermal compression. The gas is placed in thermal contact with a cold 
reservoir at temperature J, and compressed isothermally. During this 
process, work W3 is done on the gas and it gives up heat Q, to the cold 
reservoir. The reasoning used in step 1 now yields 
Equation: 


Vo 
Cc = RYT, ] foot 9 
OLS n n Vp 


where Q, is the heat dumped to the cold reservoir by the gas. 

4. Adiabatic compression. The gas is thermally isolated and returned to its 
initial state by compression. In this process, work W, is done on the gas. 
Because the compression is adiabatic, the temperature of the gas rises— 
from TJ. to Tj), in this particular case. The reasoning of step 2 now gives 
Equation: 


T.Vp)! = TV !. 


The total work done by the gas in the Carnot cycle is given by 
Equation: 


W = W, + We — W3 — W4. 


This work is equal to the area enclosed by the loop shown in the pV diagram of 
[link]. Because the initial and final states of the system are the same, the change 
of the internal energy of the gas in the cycle must be zero, that is, AFing = 0. 
The first law of thermodynamics then gives 

Equation: 


W = Q —- AEF = (Qn — Q-) — 0, 


and 
Equation: 


W = Qin- Qe. 


To find the efficiency of this engine, we first divide Q. by Qn : 
Equation: 
Q. se InVo/ Vp 


When the adiabatic constant from step 2 is divided by that of step 4, we find 
Equation: 


Substituting this into the equation for Q./Qy, we obtain 
Equation: 


Finally, with [link], we find that the efficiency of this ideal gas Carnot engine is 
given by 


Note: 
Equation: 


An engine does not necessarily have to follow a Carnot engine cycle. All engines, 
however, have the same net effect, namely the absorption of heat from a hot 
reservoir, the production of work, and the discarding of heat to a cold reservoir. 
This leads us to ask: Do all reversible cycles operating between the same two 
reservoirs have the same efficiency? The answer to this question comes from the 
second law of thermodynamics discussed earlier: All reversible engine cycles 
produce exactly the same efficiency. Also, as you might expect, all real engines 
operating between two reservoirs are less efficient than reversible engines 
operating between the same two reservoirs. This too is a consequence of the 
second law of thermodynamics shown earlier. 


The cycle of an ideal gas Carnot refrigerator is represented by the pV diagram of 
[link]. It is a Carnot engine operating in reverse. The refrigerator extracts heat Q. 
from a cold-temperature reservoir at T, when the ideal gas expands isothermally. 
The gas is then compressed adiabatically until its temperature reaches 7}, after 
which an isothermal compression of the gas results in heat Q, being discarded to 
a high-temperature reservoir at 7}. Finally, the cycle is completed by an adiabatic 
expansion of the gas, causing its temperature to drop to T,. 


The work done on the gas in one cycle of the Carnot 
refrigerator is shown and given by the area enclosed by 
the loop MPONM. 


The work done on the ideal gas is equal to the area enclosed by the path of the pV 
diagram. From the first law, this work is given by 
Equation: 


W= Qh _ Q.. 
An analysis just like the analysis done for the Carnot engine gives 
Equation: 


Q. Qn 


T°  T. 


When combined with [link], this yields 


Note: 
Equation: 


for the coefficient of performance of the ideal-gas Carnot refrigerator. Similarly, 
we can work out the coefficient of performance for a Carnot heat pump as 


Note: 
Equation: 


Qn Th 


Kos as : 
Qn —- Qe iis Ih 


We have just found equations representing the efficiency of a Carnot engine and 
the coefficient of performance of a Carnot refrigerator or a Carnot heat pump, 
assuming an ideal gas for the working substance in both devices. However, these 
equations are more general than their derivations imply. We will soon show that 
they are both valid no matter what the working substance is. 


Carnot summarized his study of the Carnot engine and Carnot cycle into what is 
now known as Carnot’s principle: 


Note: 

Carnot’s Principle 

No engine working between two reservoirs at constant temperatures can have a 
greater efficiency than a reversible engine. 


This principle can be viewed as another statement of the second law of 
thermodynamics and can be shown to be equivalent to the Kelvin statement and 
the Clausius statement. 


Example: 

The Carnot Engine 

A Carnot engine has an efficiency of 0.60 and the temperature of its cold 
reservoir is 300 K. (a) What is the temperature of the hot reservoir? (b) If the 
engine does 300 J of work per cycle, how much heat is removed from the high- 
temperature reservoir per cycle? (c) How much heat is exhausted to the low- 
temperature reservoir per cycle? 

Strategy 

From the temperature dependence of the thermal efficiency of the Carnot engine, 
we can find the temperature of the hot reservoir. Then, from the definition of the 
efficiency, we can find the heat removed when the work done by the engine is 
given. Finally, energy conservation will lead to how much heat must be dumped 
to the cold reservoir. 

Solution 


a. From e = 1 — T,/Th we have 
Equation: 


so that the temperature of the hot reservoir is 
Equation: 


300 K 


4 peeeee toasted 
ned 


= Ge 


b. By definition, the efficiency of the engine is e = W/Q, so that the heat 
removed from the high-temperature reservoir per cycle is 
Equation: 


W  300J 
ad eee 51114 f 
oie = 5060 


c. From the first law, the heat exhausted to the low-temperature reservoir per 
cycle by the engine is 
Equation: 


Oo, = WV —o00 500 OU, 


Significance 

A Carnot engine has the maximum possible efficiency of converting heat into 
work between two reservoirs, but this does not necessarily mean it is 100% 
efficient. As the difference in temperatures of the hot and cold reservoir 
increases, the efficiency of a Carnot engine increases. 


Example: 

A Carnot Heat Pump 

Imagine a Carnot heat pump operates between an outside temperature of 0 °C 
and an inside temperature of 20.0 “C. What is the work needed if the heat 
delivered to the inside of the house is 30.0 kJ? 

Strategy 

Because the heat pump is assumed to be a Carnot pump, its performance 
coefficient is given by Kp = Q,/W = T),/(Th — T-). Thus, we can find the 
work W from the heat delivered Qp. 

Solution 

The work needed is obtained from 

Equation: 


Significance 

We note that this work depends not only on the heat delivered to the house but 
also on the temperatures outside and inside. The dependence on the temperature 
outside makes them impractical to use in areas where the temperature is much 
colder outside than room temperature. 


In terms of energy costs, the heat pump is a very economical means for heating 
buildings ({Llink]). Contrast this method with turning electrical energy directly 
into heat with resistive heating elements. In this case, one unit of electrical energy 
furnishes at most only one unit of heat. Unfortunately, heat pumps have problems 
that do limit their usefulness. They are quite expensive to purchase compared to 
resistive heating elements, and, as the performance coefficient for a Carnot heat 
pump shows, they become less effective as the outside temperature decreases. In 
fact, below about —10 °C, the heat they furnish is less than the energy used to 
operate them. 


A photograph of a heat pump (large box) located 
outside a house. This heat pump is located in a warm 
climate area, like the southern United States, since it 

would be far too inefficient located in the northern 
half of the United States. (credit: modification of 
work by Peter Stevens) 


Note: 
Exercise: 


Problem: 


Check Your Understanding A Carnot engine operates between reservoirs 
at 400 °C and 30 °C. (a) What is the efficiency of the engine? (b) If the 
engine does 5.0 J of work per cycle, how much heat per cycle does it absorb 
from the high-temperature reservoir? (c) How much heat per cycle does it 
exhaust to the cold-temperature reservoir? (d) What temperatures at the 
cold reservoir would give the minimum and maximum efficiency? 


Solution: 


aes deh (oh Orel. — Oa e: 
Oe= Oy We = 2 8 Oran 00e © 


Note: 
Exercise: 


Problem: 

Check Your Understanding A Carnot refrigerator operates between two 
heat reservoirs whose temperatures are 0 “C and 25 °C. (a) What is the 
coefficient of performance of the refrigerator? (b) If 200 J of work are done 
on the working substance per cycle, how much heat per cycle is extracted 


from the cold reservoir? (c) How much heat per cycle is discarded to the 
hot reservoir? 


Solution: 


a. Kp = T./(Th — T.) = 10.9; b. Qo = KRW = 2.18kJ; c. 
On = Q. + W =2.38k3 


Summary 


e The Carnot cycle is the most efficient engine for a reversible cycle designed 
between two reservoirs. 

¢ The Carnot principle is another way of stating the second law of 
thermodynamics. 


Conceptual Questions 


Exercise: 


Problem: 


To increase the efficiency of a Carnot engine, should the temperature of the 
hot reservoir be raised or lowered? What about the cold reservoir? 


Solution: 


In order to increase the efficiency, the temperature of the hot reservoir 
should be raised, and the cold reservoir should be lowered as much as 
possible. This can be seen in [link]. 


Exercise: 


Problem: How could you design a Carnot engine with 100% efficiency? 


Exercise: 


Problem: What type of processes occur in a Carnot cycle? 
Solution: 


adiabatic and isothermal processes 


Problems 


Exercise: 
Problem: 
The temperature of the cold and hot reservoirs between which a Carnot 


refrigerator operates are —73 °C and 270 °C, respectively. Which is its 
coefficient of performance? 


Solution: 


2.0 
Exercise: 
Problem: 
Suppose a Carnot refrigerator operates between TJ, and 7}. Calculate the 
amount of work required to extract 1.0 J of heat from the cold reservoir if (a) 


Ta 7 OH 97 C WT SC 727°C) = = 173 'C 
T= OF Com GT. 98 CT. 27 C. 


Exercise: 


Problem: 


A Carnot engine operates between reservoirs at 600 and 300 K. If the engine 
absorbs 100 J per cycle at the hot reservoir, what is its work output per 
cycle? 


Solution: 


ou! 
Exercise: 
Problem: 
A 500-W motor operates a Carnot refrigerator between —5 °C and 30 °C. 


(a) What is the amount of heat per second extracted from the inside of the 
refrigerator? (b) How much heat is exhausted to the outside air per second? 


Exercise: 


Problem: Sketch a Carnot cycle on a temperature-volume diagram. 


Solution: 


lsotherms 


Adiabatic curves 


Exercise: 
Problem: 
A Carnot heat pump operates between 0 °C and 20 °C. How much heat is 


exhausted into the interior of a house for every 1.0 J of work done by the 
pump? 


Exercise: 
Problem: 
An engine operating between heat reservoirs at 20 “C and 200 °C extracts 
1000 J per cycle from the hot reservoir. (a) What is the maximum possible 


work that engine can do per cycle? (b) For this maximum work, how much 
heat is exhausted to the cold reservoir per cycle? 


Solution: 


a. 381 J; b. 619 J 
Exercise: 
Problem: 
Suppose a Carnot engine can be operated between two reservoirs as either a 


heat engine or a refrigerator. How is the coefficient of performance of the 
refrigerator related to the efficiency of the heat engine? 


Exercise: 


Problem: 


A Carnot engine is used to measure the temperature of a heat reservoir. The 
engine operates between the heat reservoir and a reservoir consisting of 
water at its triple point. (a) If 400 J per cycle are removed from the heat 
reservoir while 200 J per cycle are deposited in the triple-point reservoir, 
what is the temperature of the heat reservoir? (b) If 400 J per cycle are 
removed from the triple-point reservoir while 200 J per cycle are deposited 
in the heat reservoir, what is the temperature of the heat reservoir? 


Solution: 


a. 546 K; b. 137 K 


Exercise: 


Problem: 


What is the minimum work required of a refrigerator if it is to extract 50 J 
per cycle from the inside of a freezer at —10 °C and exhaust heat to the air 
at 25 “C? 


Glossary 


Carnot cycle 
cycle that consists of two isotherms at the temperatures of two reservoirs 
and two adiabatic processes connecting the isotherms 


Carnot engine 
Carnot heat engine, refrigerator, or heat pump that operates on a Carnot 
cycle 


Carnot principle 
principle governing the efficiency or performance of a heat device operating 
on a Carnot cycle: any reversible heat device working between two 
reservoirs must have the same efficiency or performance coefficient, greater 
than that of an irreversible heat device operating between the same two 
reservoirs 


Entropy 
By the end of this section you will be able to: 


e Describe the meaning of entropy 
¢ Calculate the change of entropy for some simple processes 


The second law of thermodynamics is best expressed in terms of a change 
in the thermodynamic variable known as entropy, which is represented by 
the symbol S. Entropy, like internal energy, is a state function. This means 
that when a system makes a transition from one state into another, the 
change in entropy ASS is independent of path and depends only on the 
thermodynamic variables of the two states. 


We first consider AS for a system undergoing a reversible process at a 
constant temperature. In this case, the change in entropy of the system is 
given by 


Note: 
Equation: 


where Q is the heat exchanged by the system kept at a temperature T (in 
kelvin). If the system absorbs heat—that is, with Q > O0O—the entropy of the 
system increases. As an example, suppose a gas is kept at a constant 
temperature of 300 K while it absorbs 10 J of heat in a reversible process. 
Then from [link], the entropy change of the gas is 
Equation: 

10 J 


S = —— = 0.033 J/K. 
300 K / 


Similarly, if the gas loses 5.0 J of heat; that is, Q = —5.0 J, at temperature 
T' = 200 K, we have the entropy change of the system given by 
Equation: 


_ -5.0I 


= —0.025 J/K. 
200 K / 


Example: 

Entropy Change of Melting Ice 

Heat is slowly added to a 50-g chunk of ice at 0 °C until it completely 
melts into water at the same temperature. What is the entropy change of the 
ice? 

Strategy 

Because the process is slow, we can approximate it as a reversible process. 
The temperature is a constant, and we can therefore use [link] in the 
calculation. 

Solution 

The ice is melted by the addition of heat: 

Equation: 


Q=mL;=50g x 335 J/g = 16.8kJ. 


In this reversible process, the temperature of the ice-water mixture is fixed 
at 0 °C or 273 K. Now from AS = Q/T, the entropy change of the ice is 
Equation: 


— 16.8kJ 
873K 


AS = 61.53/K 


when it melts to water at 0 °C. 

Significance 

During a phase change, the temperature is constant, allowing us to use 
[link] to solve this problem. The same equation could also be used if we 
changed from a liquid to a gas phase, since the temperature does not 
change during that process either. 


The change in entropy of a system for an arbitrary, reversible transition for 
which the temperature is not necessarily constant is defined by modifying 
AS = Q/T. Imagine a system making a transition from state A to B in 
small, discrete steps. The temperatures associated with these states are T', 
and T’p, respectively. During each step of the transition, the system 
exchanges heat AQ; reversibly at a temperature T;. This can be 
accomplished experimentally by placing the system in thermal contact with 
a large number of heat reservoirs of varying temperatures 77, as illustrated 
in [link]. The change in entropy for each step is AS; = Q;/T;. The net 
change in entropy of the system for the transition is 

Equation: 


AS =Sp-Ss=)° AS;=S— a, 


We now take the limit as AQ; — 0, and the number of steps approaches 
infinity. Then, replacing the summation by an integral, we obtain 


Note: 
Equation: 


where the integral is taken between the initial state A and the final state B. 
This equation is valid only if the transition from A to B is reversible. 


The gas expands at constant pressure as its temperature is increased in 
small steps through the use of a series of heat reservoirs. 


As an example, let us determine the net entropy change of a reversible 
engine while it undergoes a single Carnot cycle. In the adiabatic steps 2 and 
4 of the cycle shown in [link], no heat exchange takes place, so 


ASo=A6i= / dQ/T = 0. In step 1, the engine absorbs heat Q,, at a 


temperature Tj, so its entropy change is AS, = Q;,/Th. Similarly, in step 
3, AS3 = —Q,/T-. The net entropy change of the engine in one cycle of 
operation is then 

Equation: 


AS = AS) + ASy + AS3 + AS, = Qh — Qe 
Th Te 


However, we know that for a Carnot engine, 
Equation: 
Qn _ Qe 
Th I. 


SO 


Equation: 


ASz = 0. 


There is no net change in the entropy of the Carnot engine over a complete 
cycle. Although this result was obtained for a particular case, its validity 
can be shown to be far more general: There is no net change in the entropy 
of a system undergoing any complete reversible cyclic process. 
Mathematically, we write this statement as 


Note: 
Equation: 


where f represents the integral over a closed reversible path. 


We can use [link] to show that the entropy change of a system undergoing a 
reversible process between two given states is path independent. An 
arbitrary, closed path for a reversible cycle that passes through the states A 


and B is shown in [link]. From [link], § dS = 0 for this closed path. We 


may split this integral into two segments, one along I, which leads from A 
to B, the other along II, which leads from B to A. Then 
Equation: 


Since the process is reversible, 
Equation: 


The closed loop passing through 
states A and B represents a reversible 
cycle. 


Hence, the entropy change in going from A to B is the same for paths I and 
II. Since paths I and II are arbitrary, reversible paths, the entropy change in 
a transition between two equilibrium states is the same for all the reversible 
processes joining these states. Entropy, like internal energy, is therefore a 
state function. 


What happens if the process is irreversible? When the process is 
irreversible, we expect the entropy of a closed system, or the system and its 


environment (the universe), to increase. Therefore we can rewrite this 
expression aS 


Note: 
Equation: 


AS > 0, 


where S is the total entropy of the closed system or the entire universe, and 
the equal sign is for a reversible process. The fact is the entropy statement 
of the second law of thermodynamics: 


Note: 
Second Law of Thermodynamics (Entropy statement) 
The entropy of a closed system and the entire universe never decreases. 


We can show that this statement is consistent with the Kelvin statement, the 
Clausius statement, and the Carnot principle. 


Example: 

Entropy Change of a System during an Isobaric Process 

Determine the entropy change of an object of mass m and specific heat c 
that is cooled rapidly (and irreversibly) at constant pressure from 7}, to T,. 
Strategy 

The process is clearly stated as an irreversible process; therefore, we 
cannot simply calculate the entropy change from the actual process. 
However, because entropy of a system is a function of state, we can 
imagine a reversible process that starts from the same initial state and ends 


at the given final state. Then, the entropy change of the system is given by 


B 
[link], AS = / dQ/T. 
A 


Solution 

To replace this rapid cooling with a process that proceeds reversibly, we 
imagine that the hot object is put into thermal contact with successively 
cooler heat reservoirs whose temperatures range from 7}, to T¢. 
Throughout the substitute transition, the object loses infinitesimal amounts 
of heat dQ, so we have 

Equation: 


ee dQ 
ie 


(Ao) = 


From the definition of heat capacity, an infinitesimal exchange dQ for the 
object is related to its temperature change dT by 
Equation: 


dQ = mcdT. 


Substituting this dQ into the expression for AS, we obtain the entropy 
change of the object as it is cooled at constant pressure from 7}, to TJ, : 
Equation: 


Te 18 ie 
AS = ee = mcln—. 
7 - a 


Note that AS’ < 0 here because T, < Ty. In other words, the object has 
lost some entropy. But if we count whatever is used to remove the heat 
from the object, we would still end up with A Syniverse > 0 because the 
process is irreversible. 

Significance 

If the temperature changes during the heat flow, you must keep it inside the 
integral to solve for the change in entropy. If, however, the temperature is 
constant, you can simply calculate the entropy change as the heat flow 
divided by the temperature. 


Example: 

Stirling Engine 

The steps of a reversible Stirling engine are as follows. For this problem, 
we will use 0.0010 mol of a monatomic gas that starts at a temperature of 
133 °C and a volume of 0.10 m®°, which will be called point A. Then it 
goes through the following steps: 


1. Step AB: isothermal expansion at 133 °C from 0.10 m?® to 0.20 m® 
2. Step BC: isochoric cooling to 33 “C 

3. Step CD: isothermal compression at 33 °C from 0.20 m® to 0.10 m? 
4. Step DA: isochoric heating back to 133 °C and 0.10 m? 


(a) Draw the pV diagram for the Stirling engine with proper labels. 
(b) Fill in the following table. 


Step W (J) Q (J) AS (J/K) 
Step AB 
Step BC 
Step CD 
Step DA 


Complete cycle 


(c) How does the efficiency of the Stirling engine compare to the Carnot 
engine working within the same two heat reservoirs? 

Strategy 

Using the ideal gas law, calculate the pressure at each point so that they can 
be labeled on the pV diagram. Isothermal work is calculated using 


W =n? in (#) , and an isochoric process has no work done. The heat 


flow is calculated from the first law of thermodynamics, Q = AFing — W 
where AF;,4 = $nRAT for monatomic gasses. Isothermal steps have a 
change in entropy of Q/T, whereas isochoric steps have 


KS 2nR In (4) . The efficiency of a heat engine is calculated by 


using €gtir = W/Qn. 
Solution 


a. The graph is shown below. 


p (atm) 


34 


26 


0.10 0.20 Vv (m?) 


b. The completed table is shown below. 


Step W (J) Q (J) AS (J/K) 


Step AB Isotherm 23 25 0.0057 
Step BC Isochoric 0 —1.2 0.0035 
Step CD Isotherm —1.8 —1.8 —0.0059 
Step DA Isochoric 0 2 —0.0035 
Complete cycle O33 0.5 AA) 


c. The efficiency of the Stirling heat engine is 
Equation: 


esr — On — (One + Oop) (Qup 4 pa) — 00/20 — 0d) 


If this were a Carnot engine operating between the same heat reservoirs, its 
efficiency would be 
Equation: 


Te 
ee ie (F) — 0.25. 
Th 


Therefore, the Carnot engine would have a greater efficiency than the 
Stirling engine. 

Significance 

In the early days of steam engines, accidents would occur due to the high 
pressure of the steam in the boiler. Robert Stirling developed an engine in 
1816 that did not use steam and therefore was safer. The Stirling engine 
was commonly used in the nineteenth century, but developments in steam 
and internal combustion engines have made it difficult to broaden the use 
of the Stirling engine. 

The Stirling engine uses compressed air as the working substance, which 
passes back and forth between two chambers with a porous plug, called the 


regenerator, which is made of material that does not conduct heat as well. 
In two of the steps, pistons in the two chambers move in phase. 


Summary 


e The change in entropy for a reversible process at constant temperature 
is equal to the heat divided by the temperature. The entropy change of 


B 
a system under a reversible process is given by AS = / dQ /T. 
A 


e A system’s change in entropy between two states is independent of the 
reversible thermodynamic path taken by the system when it makes a 
transition between the states. 


Conceptual Questions 


Exercise: 


Problem: 


Does the entropy increase for a Carnot engine for each cycle? 
Exercise: 


Problem: 


Is it possible for a system to have an entropy change if it neither 
absorbs nor emits heat during a reversible transition? What happens if 
the process is irreversible? 


Solution: 


Entropy will not change if it is a reversible transition but will change if 
the process is irreversible. 


Problems 


Exercise: 
Problem: 


Two hundred joules of heat are removed from a heat reservoir at a 
temperature of 200 K. What is the entropy change of the reservoir? 


Solution: 


—1 J/K 
Exercise: 
Problem: 
In an isothermal reversible expansion at 27 °C, an ideal gas does 20 J 
of work. What is the entropy change of the gas? 
Exercise: 
Problem: 


An ideal gas at 300 K is compressed isothermally to one-fifth its 
original volume. Determine the entropy change per mole of the gas. 


Solution: 


—13 J(K mole) 
Exercise: 
Problem: 
What is the entropy change of 10 g of steam at 100 “C when it 
condenses to water at the same temperature? 


Exercise: 


Problem: 


A metal rod is used to conduct heat between two reservoirs at 
temperatures 7}, and 7’, respectively. When an amount of heat Q 
flows through the rod from the hot to the cold reservoir, what is the net 
entropy change of the rod, the hot reservoir, the cold reservoir, and the 
universe? 


Solution: 
Q Q 1 1 
Be Ceee 
Exercise: 
Problem: 


For the Carnot cycle of [link], what is the entropy change of the hot 
reservoir, the cold reservoir, and the universe? 


Exercise: 
Problem: 
A 5.0-kg piece of lead at a temperature of 600 °C is placed in a lake 


whose temperature is 15 “C. Determine the entropy change of (a) the 
lead piece, (b) the lake, and (c) the universe. 


Solution: 


a. —709 J/K; b. 1300 J/K; c. 591 J/K 
Exercise: 
Problem: 
One mole of an ideal gas doubles its volume in a reversible isothermal 


expansion. (a) What is the change in entropy of the gas? (b) If 1500 J 
of heat are added in this process, what is the temperature of the gas? 


Exercise: 


Problem: 


One mole of an ideal monatomic gas is confined to a rigid container. 
When heat is added reversibly to the gas, its temperature changes from 
T, to Ty. (a) How much heat is added? (b) What is the change in 
entropy of the gas? 


Solution: 


a =—nhaATsbs =n in(1>/7;) 
Exercise: 


Problem: 


(a) A 5.0-kg rock at a temperature of 20 °C is dropped into a shallow 
lake also at 20 °C from a height of 1.0 x 10° m. What is the 
resulting change in entropy of the universe? (b) If the temperature of 
the rock is 100 °“C when it is dropped, what is the change of entropy of 
the universe? Assume that air friction is negligible (not a good 
assumption) and that c = 860 J/kg - K is the specific heat of the rock. 


Glossary 


entropy 
state function of the system that changes when heat is transferred 
between the system and the environment 


entropy statement of the second law of thermodynamics 
entropy of a closed system or the entire universe never decreases 


Entropy on a Microscopic Scale 
By the end of this section you will be able to: 


e Interpret the meaning of entropy at a microscopic scale 

¢ Calculate a change in entropy for an irreversible process of a system 
and contrast with the change in entropy of the universe 

e Explain the third law of thermodynamics 


We have seen how entropy is related to heat exchange at a particular 
temperature. In this section, we consider entropy from a statistical 
viewpoint. Although the details of the argument are beyond the scope of 
this textbook, it turns out that entropy can be related to how disordered or 
randomized a system is—the more it is disordered, the higher is its entropy. 
For example, a new deck of cards is very ordered, as the cards are arranged 
numerically by suit. In shuffling this new deck, we randomize the 
arrangement of the cards and therefore increase its entropy ([link]). Thus, 
by picking one card off the top of the deck, there would be no indication of 
what the next selected card will be. 


The entropy of a new deck of cards goes up after the 
dealer shuffles them. (credit: “Rommel 
SK”/YouTube) 


The second law of thermodynamics requires that the entropy of the universe 
increase in any irreversible process. Thus, in terms of order, the second law 
may be stated as follows: 


In any irreversible process, the universe becomes more disordered. For 
example, the irreversible free expansion of an ideal gas, shown in [link], 
results in a larger volume for the gas molecules to occupy. A larger volume 
means more possible arrangements for the same number of atoms, so 
disorder is also increased. As a result, the entropy of the gas has gone up. 
The gas in this case is a closed system, and the process is irreversible. 
Changes in phase also illustrate the connection between entropy and 
disorder. 


Example: 

Entropy Change of the Universe 

Suppose we place 50 g of ice at 0 °C in contact with a heat reservoir at 

20 °C. Heat spontaneously flows from the reservoir to the ice, which melts 
and eventually reaches a temperature of 20 °C. Find the change in entropy 
of (a) the ice and (b) the universe. 

Strategy 

Because the entropy of a system is a function of its state, we can imagine 
two reversible processes for the ice: (1) ice is melted at 0 °C(T4); and (2) 
melted ice (water) is warmed up from 0 °C to 20 °C(T'g) under constant 
pressure. Then, we add the change in entropy of the reservoir when we 
calculate the change in entropy of the universe. 

Solution 


a. From [link], the increase in entropy of the ice is 
Equation: 


oe = AS; =f AS» 


B 
_ mby aT’ 
a +me | T 


= (22535 +50 x 4.19 x In3%) J/K 


=A Ob Se 
b. During this transition, the reservoir gives the ice an amount of heat 
equal to 
Equation: 


Q = mL++me(Tp — Ta) 
= 50 x (335 +419" < 20) J 
— 2.10 x 10° J. 


This leads to a change (decrease) in entropy of the reservoir: 
Equation: 


INS cneruoir = So = —71.7 J/K. 
TB 


The increase in entropy of the universe is therefore 
Equation: 


A Siuniverse = 76.3 J/K — 71.75/K = 4.63/K > 0. 


Significance 

The entropy of the universe therefore is greater than zero since the ice 
gains more entropy than the reservoir loses. If we considered only the 
phase change of the ice into water and not the temperature increase, the 
entropy change of the ice and reservoir would be the same, resulting in the 
universe gaining no entropy. 


This process also results in a more disordered universe. The ice changes 
from a solid with molecules located at specific sites to a liquid whose 
molecules are much freer to move. The molecular arrangement has 
therefore become more randomized. Although the change in average kinetic 
energy of the molecules of the heat reservoir is negligible, there is 
nevertheless a significant decrease in the entropy of the reservoir because it 
has many more molecules than the melted ice cube. However, the 
reservoir’s decrease in entropy is still not as large as the increase in entropy 
of the ice. The increased disorder of the ice more than compensates for the 
increased order of the reservoir, and the entropy of the universe increases by 
4.6 J/K. 


You might suspect that the growth of different forms of life might be a net 
ordering process and therefore a violation of the second law. After all, a 
single cell gathers molecules and eventually becomes a highly structured 
organism, such as a human being. However, this ordering process is more 
than compensated for by the disordering of the rest of the universe. The net 
result is an increase in entropy and an increase in the disorder of the 
universe. 


Note: 
Exercise: 


Problem: 


Check Your Understanding In [link], the spontaneous flow of heat 
from a hot object to a cold object results in a net increase in entropy 
of the universe. Discuss how this result can be related to an increase 
in disorder of the system. 


Solution: 


When heat flows from the reservoir to the ice, the internal (mainly 
kinetic) energy of the ice goes up, resulting in a higher average speed 
and thus an average greater position variance of the molecules in the 
ice. The reservoir does become more ordered, but due to its much 


larger amount of molecules, it does not offset the change in entropy in 
the system. 


The second law of thermodynamics makes clear that the entropy of the 
universe never decreases during any thermodynamic process. For any other 
thermodynamic system, when the process is reversible, the change of the 
entropy is given by AS = Q/T. But what happens if the temperature goes 
to zero, J’ — 0? It turns out this is not a question that can be answered by 
the second law. 


A fundamental issue still remains: Is it possible to cool a system all the way 
down to zero kelvin? We understand that the system must be at its lowest 
energy state because lowering temperature reduces the kinetic energy of the 
constituents in the system. What happens to the entropy of a system at the 
absolute zero temperature? It turns out the absolute zero temperature is not 
reachable—at least, not though a finite number of cooling steps. This is a 
statement of the third law of thermodynamics, whose proof requires 
quantum mechanics that we do not present here. In actual experiments, 
physicists have continuously pushed that limit downward, with the lowest 
temperature achieved at about 1 x 10 !° K ina low-temperature lab at the 
Helsinki University of Technology in 2008. 


Like the second law of thermodynamics, the third law of thermodynamics 
can be stated in different ways. One of the common statements of the third 
law of thermodynamics is: The absolute zero temperature cannot be 
reached through any finite number of cooling steps. 


In other words, the temperature of any given physical system must be finite, 
that is, TY’ > 0. This produces a very interesting question in physics: Do we 
know how a system would behave if it were at the absolute zero 
temperature? 


The reason a system is unable to reach 0 K is fundamental and requires 
quantum mechanics to fully understand its origin. But we can certainly ask 
what happens to the entropy of a system when we try to cool it down to 0 


K. Because the amount of heat that can be removed from the system 
becomes vanishingly small, we expect that the change in entropy of the 
system along an isotherm approaches zero, that is, 


Note: 
Equation: 


lim (AS), = 0. 


T-0 


This can be viewed as another statement of the third law, with all the 
isotherms becoming isentropic, or into a reversible ideal adiabat. We can 
put this expression in words: A system becomes perfectly ordered when its 
temperature approaches absolute zero and its entropy approaches its 
absolute minimum. 


The third law of thermodynamics puts another limit on what can be done 
when we look for energy resources. If there could be a reservoir at the 
absolute zero temperature, we could have engines with efficiency of 100%, 
which would, of course, violate the second law of thermodynamics. 


Example: 

Entropy Change of an Ideal Gas in Free Expansion 

An ideal gas occupies a partitioned volume V; inside a box whose walls 
are thermally insulating, as shown in [link](a). When the partition is 
removed, the gas expands and fills the entire volume V2 of the box, as 
shown in part (b). What is the entropy change of the universe (the system 
plus its environment)? 


(a) (b) 


The adiabatic free expansion of an ideal gas from volume Vj to 
volume V9. 


Strategy 

The adiabatic free expansion of an ideal gas is an irreversible process. 
There is no change in the internal energy (and hence temperature) of the 
gas in such an expansion because no work or heat transfer has happened. 
Thus, a convenient reversible path connecting the same two equilibrium 
States is a slow, isothermal expansion from Vj to V3. In this process, the 
gas could be expanding against a piston while in thermal contact with a 
heat reservoir, as in step 1 of the Carnot cycle. 

Solution 

Since the temperature is constant, the entropy change is given by 

AS = Q/T, where 

Equation: 


V2 
= pdV 
Vi 


because AF, = 0. Now, with the help of the ideal gas law, we have 
Equation: 


V. 
2 dV Vs 
Q=nR iL V nR ny 


so the change in entropy of the gas is 
Equation: 


Q V2 

AS T nRin V," 
Because V2 > Vj, AS is positive, and the entropy of the gas has gone up 
during the free expansion. 
Significance 
What about the environment? The walls of the container are thermally 
insulating, so no heat exchange takes place between the gas and its 
surroundings. The entropy of the environment is therefore constant during 
the expansion. The net entropy change of the universe is then simply the 
entropy change of the gas. Since this is positive, the entropy of the 
universe increases in the free expansion of the gas. 


Example: 

Entropy Change during Heat Transfer 

Heat flows from a steel object of mass 4.00 kg whose temperature is 400 K 
to an identical object at 300 K. Assuming that the objects are thermally 
isolated from the environment, what is the net entropy change of the 
universe after thermal equilibrium has been reached? 

Strategy 

Since the objects are identical, their common temperature at equilibrium is 
350 K. To calculate the entropy changes associated with their transitions, 
we substitute the irreversible process of the heat transfer by two isobaric, 
reversible processes, one for each of the two objects. The entropy change 
for each object is then given by AS = mcln(Tp/T4). 


Solution 
Using c = 450 J/kg - K, the specific heat of steel, we have for the hotter 
object 
Equation: 
ty me dT 
NS ge = mec nz 
Dy 


= (4.00 kg) (450 J/kg -K)n 30% = —240J/K. 


Similarly, the entropy change of the cooler object is 


Equation: 


350 K 
AS, = (4.00 kg)(450 J/kg - K) In——— = 277 J/K. 
c = (4.00 kg) (450 J/kg - K) In = 277 J/ 
The net entropy change of the two objects during the heat transfer is then 
Equation: 


AS;, + AS, = 375/K. 


Significance 

The objects are thermally isolated from the environment, so its entropy 
must remain constant. Thus, the entropy of the universe also increases by 
37 J/K. 


Note: 
Exercise: 


Problem: 


Check Your Understanding A quantity of heat Q is absorbed from a 
reservoir at a temperature TJ}, by a cooler reservoir at a temperature 
T... What is the entropy change of the hot reservoir, the cold reservoir, 
and the universe? 


Solution: 


—Q/Th; Ove and OG Re —2 Dae) 


Note: 
Exercise: 


Problem: 


Check Your Understanding A 50-g copper piece at a temperature of 
20 °C is placed into a large insulated vat of water at 100 °C. (a) What 
is the entropy change of the copper piece when it reaches thermal 
equilibrium with the water? (b) What is the entropy change of the 
water? (c) What is the entropy change of the universe? 


Solution: 


a. 4.71 J/K; b. —4.18 J/K; c. 0.53 J/K 


Note: 

View this site to learn about entropy and microstates. Start with a large 
barrier in the middle and 1000 molecules in only the left chamber. What is 
the total entropy of the system? Now remove the barrier and let the 
molecules travel from the left to the right hand side? What is the total 
entropy of the system now? Lastly, add heat and note what happens to the 
temperature. Did this increase entropy of the system? 


Summary 


e Entropy can be related to how disordered a system is—the more it is 
disordered, the higher is its entropy. In any irreversible process, the 
universe becomes more disordered. 

e According to the third law of thermodynamics, absolute zero 
temperature is unreachable. 


Key Equations 


Result of energy conservation 
Efficiency of a heat engine 


Coefficient of performance of a 
refrigerator 


Coefficient of performance of a 
heat pump 


Resulting efficiency of a Carnot 
cycle 


Performance coefficient of a 
reversible refrigerator 


Performance coefficient of a 
reversible heat pump 


Entropy of a system undergoing 
a reversible process at a constant 
temperature 


Change of entropy of a system 
under a reversible process 


Entropy of a system undergoing 
any complete reversible cyclic 
process 


Change of entropy of a closed 
system under an irreversible 
process 


Change in entropy of the system 
along an isotherm 


Ww Qn-Qe 
se w ~ a 
e=1— z 
KR= Tere 
Kp = ee 
AS=2 


Conceptual Questions 


Exercise: 
Problem: 
Are the entropy changes of the systems in the following processes 
positive or negative? (a) water vapor that condenses on a cold surface; 
(b) gas in a container that leaks into the surrounding atmosphere; (c) 
an ice cube that melts in a glass of lukewarm water; (d) the lukewarm 


water of part (c); (e) a real heat engine performing a cycle; (f) food 
cooled in a refrigerator. 


Exercise: 


Problem: 


Discuss the entropy changes in the systems of Question 21.10 in terms 
of disorder. 


Solution: 


Entropy is a function of disorder, so all the answers apply here as well. 


Problems 


Exercise: 
Problem: 
A copper rod of cross-sectional area 5.0 cm2 and length 5.0 m 
conducts heat from a heat reservoir at 373 K to one at 273 K. What is 
the time rate of change of the universe’s entropy for this process? 


Solution: 


3.78 x 10°? W/K 


Exercise: 
Problem: 
Fifty grams of water at 20 °C is heated until it becomes vapor at 
100 °C. Calculate the change in entropy of the water in this process. 
Exercise: 
Problem: 


Fifty grams of water at 0 “C are changed into vapor at 100 °C. What 
is the change in entropy of the water in this process? 


Solution: 


430 J/K 
Exercise: 
Problem: 
In an isochoric process, heat is added to 10 mol of monoatomic ideal 


gas whose temperature increases from 273 to 373 K. What is the 
entropy change of the gas? 


Exercise: 
Problem: 
Two hundred grams of water at 0 °C is brought into contact with a 
heat reservoir at 80 “C. After thermal equilibrium is reached, what is 
the temperature of the water? Of the reservoir? How much heat has 


been transferred in the process? What is the entropy change of the 
water? Of the reservoir? What is the entropy change of the universe? 


Solution: 


80 °C, 80 °C, 6.70 x 104 J, 215 J/K, -190 J/K, 25 J/K 


Exercise: 


Problem: 


Suppose that the temperature of the water in the previous problem is 
raised by first bringing it to thermal equilibrium with a reservoir at a 
temperature of 40 °C and then with a reservoir at 80 °C. Calculate the 
entropy changes of (a) each reservoir, (b) of the water, and (c) of the 
universe. 


Exercise: 
Problem: 
Two hundred grams of water at 0 “C is brought into contact into 
thermal equilibrium successively with reservoirs at 20 “C, 40 °C, 


60 °C, and 80 °C. (a) What is the entropy change of the water? (b) Of 
the reservoir? (c) What is the entropy change of the universe? 


Solution: 


ASy,0 = 215 J/K, ASg = —208 J/K, ASy =7J/K 
Exercise: 


Problem: 


(a) Ten grams of H2O starts as ice at 0 “C. The ice absorbs heat from 
the air (just above 0 °C) until all of it melts. Calculate the entropy 
change of the H2O, of the air, and of the universe. (b) Suppose that the 
air in part (a) is at 20 °C rather than 0 °C and that the ice absorbs heat 
until it becomes water at 20 “C. Calculate the entropy change of the 
HO, of the air, and of the universe. (c) Is either of these processes 
reversible? 


Exercise: 


Problem: 


The Carnot cycle is represented by the temperature-entropy diagram 
shown below. (a) How much heat is absorbed per cycle at the high- 
temperature reservoir? (b) How much heat is exhausted per cycle at the 
low-temperature reservoir? (c) How much work is done per cycle by 
the engine? (d) What is the efficiency of the engine? 


T(K) 


600 


300 


Solution: 


a. 1200 J; b. 600 J; c. 600 J; d. 0.50 
Exercise: 
Problem: 
A Carnot engine operating between heat reservoirs at 500 and 300 K 
absorbs 1500 J per cycle at the high-temperature reservoir. (a) 


Represent the engine’s cycle on a temperature-entropy diagram. (b) 
How much work per cycle is done by the engine? 


Exercise: 


Problem: 


A monoatomic ideal gas (n moles) goes through a cyclic process 
shown below. Find the change in entropy of the gas in each step and 
the total entropy change over the entire cycle. 


p 


Adiabat 


Solution: 


7 it 
AS = nCyIn(#) +nCyin (Ft) 
Exercise: 
Problem: 
A Carnot engine has an efficiency of 0.60. When the temperature of its 
cold reservoir changes, the efficiency drops to 0.55. If initially 


T., = 27 °C, determine (a) the constant value of 7}, and (b) the final 
value of Ty. 


Exercise: 


Problem: 


A Carnot engine performs 100 J of work while discharging 200 J of 
heat each cycle. After the temperature of the hot reservoir only is 
adjusted, it is found that the engine now does 130 J of work while 
discarding the same quantity of heat. (a) What are the initial and final 
efficiencies of the engine? (b) What is the fractional change in the 
temperature of the hot reservoir? 


Solution: 


a 033,0:;39"b:; 0:91 
Exercise: 


Problem: 


A Carnot refrigerator exhausts heat to the air, which is at a temperature 
of 25 “C. How much power is used by the refrigerator if it freezes 1.5 
g of water per second? Assume the water is at 0 °C. 


Additional Problems 


Exercise: 


Problem: 


A 300-W heat pump operates between the ground, whose temperature 
is 0 °C, and the interior of a house at 22 “C. What is the maximum 
amount of heat per hour that the heat pump can supply to the house? 


Solution: 


1.45 x 10’J 


Exercise: 


Problem: 


An engineer must design a refrigerator that does 300 J of work per 
cycle to extract 2100 J of heat per cycle from a freezer whose 
temperature is —10 °C. What is the maximum air temperature for 
which this condition can be met? Is this a reasonable condition to 
impose on the design? 


Exercise: 


Problem: 


A Carnot engine employs 1.5 mol of nitrogen gas as a working 
substance, which is considered as an ideal diatomic gas with y = 7.5 
at the working temperatures of the engine. The Carnot cycle goes in 
the cycle ABCDA with AB being an isothermal expansion. The volume 
at points A and C of the cycle are 5.0 x 10° m? and 0.15 L, 
respectively. The engine operates between two thermal baths of 
temperature 500 K and 300 K. (a) Find the values of volume at B and 
D. (b) How much heat is absorbed by the gas in the AB isothermal 
expansion? (c) How much work is done by the gas in the AB 
isothermal expansion? (d) How much heat is given up by the gas in the 
CD isothermal expansion? (e) How much work is done by the gas in 
the CD isothermal compression? (f) How much work is done by the 
gas in the BC adiabatic expansion? (g) How much work is done by the 
gas in the DA adiabatic compression? (h) Find the value of efficiency 
of the engine based on the net work and heat input. Compare this value 
to the efficiency of a Carnot engine based on the temperatures of the 
two baths. 


Solution: 


a. Vg = 0.042 m®, Vp = 0.018 m?°; b. 13,000 J; c. 13,000 J; d. — 
8,000 J; e. -8,000 J; f. 6200 J; g. -6200 J; h. 39%; with temperatures 
efficiency is 40%, which is off likely by rounding errors. 


Exercise: 


Problem: 


A 5.0-kg wood block starts with an initial speed of 8.0 m/s and slides 
across the floor until friction stops it. Estimate the resulting change in 
entropy of the universe. Assume that everything stays at a room 
temperature of 20 °C. 


Exercise: 
Problem: 
A system consisting of 20.0 mol of a monoatomic ideal gas is cooled 


at constant pressure from a volume of 50.0 L to 10.0 L. The initial 
temperature was 300 K. What is the change in entropy of the gas? 


Solution: 


—670 J/K 
Exercise: 
Problem: 
A glass beaker of mass 400 g contains 500 g of water at 27 “C. The 
beaker is heated reversibly so that the temperature of the beaker and 


water rise gradually to 57 °C. Find the change in entropy of the beaker 
and water together. 


Exercise: 


Problem: 


A Carnot engine operates between 550 °C and 20 °C baths and 
produces 300 kJ of energy in each cycle. Find the change in entropy of 
the (a) hot bath and (b) cold bath, in each Carnot cycle? 


Solution: 


a. -570 J/K; b. 570 J/K 


Exercise: 


Problem: 


An ideal gas at temperature T is stored in the left half of an insulating 
container of volume V using a partition of negligible volume (see 
below). What is the entropy change per mole of the gas in each of the 
following cases? (a) The partition is suddenly removed and the gas 
quickly fills the entire container. (b) A tiny hole is punctured in the 
partition and after a long period, the gas reaches an equilibrium state 
such that there is no net flow through the hole. (c) The partition is 
moved very slowly and adiabatically all the way to the right wall so 
that the gas finally fills the entire container. 


Exercise: 


Problem: 


A 0.50-kg piece of aluminum at 250 °C is dropped into 1.0 kg of 
water at 20 “C. After equilibrium is reached, what is the net entropy 
change of the system? 


Solution: 


82 J/K 


Exercise: 


Problem: 


Suppose 20 g of ice at 0 “C is added to 300 g of water at 60 “C. What 
is the total change in entropy of the mixture after it reaches thermal 
equilibrium? 

Exercise: 
Problem: 
A heat engine operates between two temperatures such that the 
working substance of the engine absorbs 5000 J of heat from the high- 
temperature bath and discharges 3000 J to the low-temperature bath. 
The rest of the energy is converted into mechanical energy of the 


turbine. Find (a) the amount of work produced by the engine and (b) 
the efficiency of the engine. 


Solution: 


a. 2000 J; b. 40% 
Exercise: 
Problem: 
A thermal engine produces 4 MJ of electrical energy while operating 
between two thermal baths of different temperatures. The working 


substance of the engine discharges 5 MJ of heat to the cold 
temperature bath. What is the efficiency of the engine? 


Exercise: 


Problem: 

A coal power plant consumes 100,000 kg of coal per hour and 
produces 500 MW of power. If the heat of combustion of coal is 30 
MJ/kg, what is the efficiency of the power plant? 


Solution: 


60% 


Exercise: 
Problem: 
A Carnot engine operates in a Carnot cycle between a heat source at 


550 °C and a heat sink at 20 °C. Find the efficiency of the Carnot 
engine. 


Exercise: 
Problem: 
A Carnot engine working between two heat baths of temperatures 600 


K and 273 K completes each cycle in 5 sec. In each cycle, the engine 
absorbs 10 kJ of heat. Find the power of the engine. 


Solution: 


64.4% 
Exercise: 
Problem: 
A Carnot cycle working between 100 °C and 30 °C is used to drive a 
refrigerator between —10 °C and 30 “C. How much energy must the 


Carnot engine produce per second so that the refrigerator is able to 
discard 10 J of energy per second? 


Challenge Problems 


Exercise: 


Problem: 


(a) An infinitesimal amount of heat is added reversibly to a system. By 
combining the first and second laws, show that dU = TdS — dW. (b) 
When heat is added to an ideal gas, its temperature and volume change 
from 7; and V; to JT and V2. Show that the entropy change of n 
moles of the gas is given by 


AS =n, nz +nkR ny. 


Solution: 


derive 
Exercise: 
Problem: 
Using the result of the preceding problem, show that for an ideal gas 
undergoing an adiabatic process, TV! is constant. 
Exercise: 
Problem: 


With the help of the two preceding problems, show that AS between 
states 1 and 2 of n moles an ideal gas is given by 


AS=nC; Inge —nR nz. 


Solution: 


derive 
Exercise: 
Problem: 
A cylinder contains 500 g of helium at 120 atm and 20 °C. The valve 
is leaky, and all the gas slowly escapes isothermally into the 


atmosphere. Use the results of the preceding problem to determine the 
resulting change in entropy of the universe. 


Exercise: 


Problem: 


A diatomic ideal gas is brought from an initial equilibrium state at 

p; = 0.50 atm and 7); = 300 K to a final stage with pp = 0.20 atm 
and T; = 500 K. Use the results of the previous problem to determine 
the entropy change per mole of the gas. 


Solution: 


18 J/K 
Exercise: 


Problem: 


The gasoline internal combustion engine operates in a cycle consisting 
of six parts. Four of these parts involve, among other things, friction, 
heat exchange through finite temperature differences, and accelerations 
of the piston; it is irreversible. Nevertheless, it is represented by the 
ideal reversible Otto cycle, which is illustrated below. The working 
substance of the cycle is assumed to be air. The six steps of the Otto 
cycle are as follows: 


i. Isobaric intake stroke (OA). A mixture of gasoline and air is 
drawn into the combustion chamber at atmospheric pressure pg as 
the piston expands, increasing the volume of the cylinder from 
zero to V4. 

. Adiabatic compression stroke (AB). The temperature of the 

mixture rises as the piston compresses it adiabatically from a 

volume Va to Vp. 

Ignition at constant volume (BC). The mixture is ignited by a 

spark. The combustion happens so fast that there is essentially no 

motion of the piston. During this process, the added heat Q 

causes the pressure to increase from pz to pc at the constant 

volume Vp(= Vc). 

iv. Adiabatic expansion (CD). The heated mixture of gasoline and air 

expands against the piston, increasing the volume from Vc to Vp. 


ee 


i 


ee 


iil. 


This is called the power stroke, as it is the part of the cycle that 

delivers most of the power to the crankshaft. 
v. Constant-volume exhaust (DA). When the exhaust valve opens, 
some of the combustion products escape. There is almost no 
movement of the piston during this part of the cycle, so the 
volume remains constant at V4(= Vp). Most of the available 
energy is lost here, as represented by the heat exhaust Qo. 
Isobaric compression (AO). The exhaust valve remains open, and 
the compression from V4, to zero drives out the remaining 
combustion products. 


(a) Using (i) e = W/Q,; (ii) W = Qi — Qo; and (iii) 
Q, = nC, (To — Tz), Qg = NC, (Tp — T 4), show that 


ee 


V1. 


Tp—-Ta 
Tco—-Tp* 


C1 


(b) Use the fact that steps (ii) and (iv) are adiabatic to show that 


1 


e = 1— ae. 
where r = V4/Vz. The quantity r is called the compression ratio of 
the engine. 


(c) In practice, r is kept less than around 7. For larger values, the 
gasoline-air mixture is compressed to temperatures so high that it 
explodes before the finely timed spark is delivered. This preignition 
causes engine knock and loss of power. Show that for r = 6 and 

+y = 1.4 (the value for air), e = 0.51, or an efficiency of 51%. 
Because of the many irreversible processes, an actual internal 
combustion engine has an efficiency much less than this ideal value. A 
typical efficiency for a tuned engine is about 25% to 30%. 


Exercise: 


Problem: 


An ideal diesel cycle is shown below. This cycle consists of five 
strokes. In this case, only air is drawn into the chamber during the 
intake stroke OA. The air is then compressed adiabatically from state A 
to state B, raising its temperature high enough so that when fuel is 
added during the power stroke BC, it ignites. After ignition ends at C, 
there is a further adiabatic power stroke CD. Finally, there is an 
exhaust at constant volume as the pressure drops from pp to py, 
followed by a further exhaust when the piston compresses the chamber 
volume to zero. 


(a) Use W = Qi _ Qo, Qi no nCallc —_ Tp); and 


Q, = nC, (Tp — T 4) to show that e = a = LTE 


(b) Use the fact that A — Band C' + D are adiabatic to show that 


(c) Since there is no preignition (remember, the chamber does not 
contain any fuel during the compression), the compression ratio can be 
larger than that for a gasoline engine. Typically, 

Va/Vp = 15 and Vp/Vo = 5. For these values and y = 1.4, show 
that e = 0.56, or an efficiency of 56%. Diesel engines actually operate 
at an efficiency of about 30% to 35% compared with 25% to 30% for 
gasoline engines. 


Solution: 


proof 


Exercise: 


Problem: 


Consider an ideal gas Joule cycle, also called the Brayton cycle, shown 
below. Find the formula for efficiency of the engine using this cycle in 
terms of P,, P2, and ¥. 


p 


Adiabat 


Adiabat 


Exercise: 
Problem: 
Derive a formula for the coefficient of performance of a refrigerator 


using an ideal gas as a working substance operating in the cycle shown 
below in terms of the properties of the three states labeled 1, 2, and 3. 


Adiabat 


Solution: 


—_ 3(p1—p2)Vi 
R ~ Sp2V3—3pi1Vi—paV1 


Exercise: 


Problem: 


Two moles of nitrogen gas, with - = 7/5 for ideal diatomic gases, 
occupies a volume of 10~*m? in an insulated cylinder at temperature 
300 K. The gas is adiabatically and reversibly compressed to a volume 
of 5 L. The piston of the cylinder is locked in its place, and the 
insulation around the cylinder is removed. The heat-conducting 
cylinder is then placed in a 300-K bath. Heat from the compressed gas 
leaves the gas, and the temperature of the gas becomes 300 K again. 
The gas is then slowly expanded at the fixed temperature 300 K until 
the volume of the gas becomes 10° 2m?, thus making a complete cycle 
for the gas. For the entire cycle, calculate (a) the work done by the gas, 
(b) the heat into or out of the gas, (c) the change in the internal energy 
of the gas, and (d) the change in entropy of the gas. 


Exercise: 


Problem: 


A Carnot refrigerator, working between 0 °C and 30 °C is used to 
cool a bucket of water containing 10~? m? of water at 30 °C to 5 °C 
in 2 hours. Find the total amount of work needed. 


Solution: 


W = 110,000 J 


Glossary 


disorder 
measure of order in a system; the greater the disorder is, the higher the 
entropy 


isentropic 
reversible adiabatic process where the process is frictionless and no 
heat is transferred 


third law of thermodynamics 
absolute zero temperature cannot be reached through any finite number 
of cooling steps 


Introduction 
class="introduction" 


Special 
relativity 
explains 
how time 

passes 

slightly 
differently 
on Earth 
and within 
the rapidly 
moving 
global 
positioning 
satellite 
(GPS). GPS 
units in 
vehicles 
could not 
find their 
correct 
location on 
Earth 
without 
taking this 
correction 
into 
account. 
(credit: 
modificatio 
n of work 
by U.S. Air 
Force) 


The special theory of relativity was proposed in 1905 by Albert Einstein 
(1879-1955). It describes how time, space, and physical phenomena appear 
in different frames of reference that are moving at constant velocity with 
respect to each other. This differs from Einstein’s later work on general 
relativity, which deals with any frame of reference, including accelerated 
frames. 


The theory of relativity led to a profound change in the way we perceive 
space and time. The “common sense” rules that we use to relate space and 
time measurements in the Newtonian worldview differ seriously from the 
correct rules at speeds near the speed of light. For example, the special 
theory of relativity tells us that measurements of length and time intervals 
are not the same in reference frames moving relative to one another. A 
particle might be observed to have a lifetime of 1.0 x 10-°sinone 
reference frame, but a lifetime of 2.0 x 10~°s in another; and an object 
might be measured to be 2.0 m long in one frame and 3.0 m long in another 
frame. These effects are usually significant only at speeds comparable to the 
speed of light, but even at the much lower speeds of the global positioning 
satellite, which requires extremely accurate time measurements to function, 
the different lengths of the same distance in different frames of reference 
are significant enough that they need to be taken into account. 


Unlike Newtonian mechanics, which describes the motion of particles, or 
Maxwell's equations, which specify how the electromagnetic field behaves, 


special relativity is not restricted to a particular type of phenomenon. 
Instead, its rules on space and time affect all fundamental physical theories. 


The modifications of Newtonian mechanics in special relativity do not 
invalidate classical Newtonian mechanics or require its replacement. 
Instead, the equations of relativistic mechanics differ meaningfully from 
those of classical Newtonian mechanics only for objects moving at 
relativistic speeds (i.e., speeds less than, but comparable to, the speed of 
light). In the macroscopic world that you encounter in your daily life, the 
relativistic equations reduce to classical equations, and the predictions of 
classical Newtonian mechanics agree closely enough with experimental 
results to disregard relativistic corrections. 


Invariance of Physical Laws 
By the end of this section, you will be able to: 


e Describe the theoretical and experimental issues that Einstein’s theory 
of special relativity addressed. 
e State the two postulates of the special theory of relativity. 


Suppose you calculate the hypotenuse of a right triangle given the base 
angles and adjacent sides. Whether you calculate the hypotenuse from one 
of the sides and the cosine of the base angle, or from the Pythagorean 
theorem, the results should agree. Predictions based on different principles 
of physics must also agree, whether we consider them principles of 
mechanics or principles of electromagnetism. 


Albert Einstein pondered a disagreement between predictions based on 
electromagnetism and on assumptions made in classical mechanics. 
Specifically, suppose an observer measures the velocity of a light pulse in 
the observer’s own rest frame; that is, in the frame of reference in which 
the observer is at rest. According to the assumptions long considered 
obvious in classical mechanics, if an observer measures a velocity V in one 
frame of reference, and that frame of reference is moving with velocity u 
past a second reference frame, an observer in the second frame measures 


the original velocity as v’ = ¥ + U. This sum of velocities is often referred 
to as Galilean relativity. If this principle is correct, the pulse of light that 
the observer measures as traveling with speed c travels at speed c + u 
measured in the frame of the second observer. If we reasonably assume that 
the laws of electrodynamics are the same in both frames of reference, then 
the predicted speed of light (in vacuum) in both frames should be 

c = 1/,/eop0. Each observer should measure the same speed of the light 
pulse with respect to that observer’s own rest frame. To reconcile 
difficulties of this kind, Einstein constructed his special theory of 
relativity, which introduced radical new ideas about time and space that 
have since been confirmed experimentally. 


Inertial Frames 


All velocities are measured relative to some frame of reference. For 
example, a car’s motion is measured relative to its starting position on the 
road it travels on; a projectile’s motion is measured relative to the surface 
from which it is launched; and a planet’s orbital motion is measured relative 
to the star it orbits. The frames of reference in which mechanics takes the 
simplest form are those that are not accelerating. Newton’s first law, the law 
of inertia, holds exactly in such a frame. 


Note: 

Inertial Reference Frame 

An inertial frame of reference is a reference frame in which a body at rest 
remains at rest and a body in motion moves at a constant speed in a straight 
line unless acted upon by an outside force. 


For example, to a passenger inside a plane flying at constant speed and 
constant altitude, physics seems to work exactly the same as when the 
passenger is standing on the surface of Earth. When the plane is taking off, 
however, matters are somewhat more complicated. In this case, the 
passenger at rest inside the plane concludes that a net force F' on an object is 
not equal to the product of mass and acceleration, ma. Instead, F is equal to 
ma plus a fictitious force. This situation is not as simple as in an inertial 
frame. The term “special” in “special relativity” refers to dealing only with 
inertial frames of reference. Einstein’s later theory of general relativity 
deals with all kinds of reference frames, including accelerating, and 
therefore non-inertial, reference frames. 


Einstein’s First Postulate 


Not only are the principles of classical mechanics simplest in inertial 
frames, but they are the same in all inertial frames. Einstein based the first 
postulate of his theory on the idea that this is true for all the laws of 
physics, not merely those in mechanics. 


Note: 
First Postulate of Special Relativity 
The laws of physics are the same in all inertial frames of reference. 


This postulate denies the existence of a special or preferred inertial frame. 
The laws of nature do not give us a way to endow any one inertial frame 
with special properties. For example, we cannot identify any inertial frame 
as being in a state of “absolute rest.” We can only determine the relative 
motion of one frame with respect to another. 


There is, however, more to this postulate than meets the eye. The laws of 
physics include only those that satisfy this postulate. We will see that the 
definitions of energy and momentum must be altered to fit this postulate. 
Another outcome of this postulate is the famous equation E = mc?, which 
relates energy to mass. 


Einstein’s Second Postulate 


The second postulate upon which Einstein based his theory of special 
relativity deals with the speed of light. Late in the nineteenth century, the 
major tenets of classical physics were well established. Two of the most 
important were the laws of electromagnetism and Newton’s laws. 
Investigations such as Young’s double-slit experiment in the early 1800s 
had convincingly demonstrated that light is a wave. Maxwell’s equations of 
electromagnetism implied that electromagnetic waves travel at 

c = 3.00 x 10°m /s in a vacuum, but they do not specify the frame of 
reference in which light has this speed. Many types of waves were known, 
and all travelled in some medium. Scientists therefore assumed that some 
medium catried the light, even in a vacuum, and that light travels at a speed 
c relative to that medium (often called “the aether”). 


Starting in the mid-1880s, the American physicist A.A. Michelson, later 
aided by E.W. Morley, made a series of direct measurements of the speed of 
light. They intended to deduce from their data the speed v at which Earth 
was moving through the mysterious medium for light waves. The speed of 


light measured on Earth should have been c +. v when Earth’s motion was 
opposite to the medium’s flow at speed u past the Earth, and c — v when 
Earth was moving in the same direction as the medium. The results of their 
measurements were Startling. 


Note: 

Michelson-Morley Experiment 

The Michelson-Morley experiment demonstrated that the speed of light 
in a vacuum is independent of the motion of Earth about the Sun. 


The eventual conclusion derived from this result is that light, unlike 
mechanical waves such as sound, does not need a medium to carry it. 
Furthermore, the Michelson-Morley results implied that the speed of light c 
is independent of the motion of the source relative to the observer. That is, 
everyone observes light to move at speed c regardless of how they move 
relative to the light source or to one another. For several years, many 
scientists tried unsuccessfully to explain these results within the framework 
of Newton’s laws. 


In addition, there was a contradiction between the principles of 
electromagnetism and the assumption made in Newton’s laws about relative 
velocity. Classically, the velocity of an object in one frame of reference and 
the velocity of that object in a second frame of reference relative to the first 
should combine like simple vectors to give the velocity seen in the second 
frame. If that were correct, then two observers moving at different speeds 
would see light traveling at different speeds. Imagine what a light wave 
would look like to a person traveling along with it (in vacuum) at a speed c. 
If such a motion were possible, then the wave would be stationary relative 
to the observer. It would have electric and magnetic fields whose strengths 
varied with position but were constant in time. This is not allowed by 
Maxwell’s equations. So either Maxwell’s equations are different in 
different inertial frames, or an object with mass cannot travel at speed c. 
Einstein concluded that the latter is true: An object with mass cannot travel 


at speed c. Maxwell’s equations are correct, but Newton’s addition of 
velocities is not correct for light. 


Not until 1905, when Einstein published his first paper on special relativity, 
was the currently accepted conclusion reached. Based mostly on his 
analysis that the laws of electricity and magnetism would not allow another 
speed for light, and only slightly aware of the Michelson-Morley 
experiment, Einstein detailed his second postulate of special relativity. 


Note: 

Second Postulate of Special Relativity 

Light travels in a vacuum with the same speed c in any direction in all 
inertial frames. 


In other words, the speed of light has the same definite speed for any 
observer, regardless of the relative motion of the source. This deceptively 
simple and counterintuitive postulate, along with the first postulate, leave 
all else open for change. Among the changes are the loss of agreement on 
the time between events, the variation of distance with speed, and the 
realization that matter and energy can be converted into one another. We 
describe these concepts in the following sections. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Explain how special relativity differs 
from general relativity. 


Solution: 


Special relativity applies only to objects moving at constant velocity, 
whereas general relativity applies to objects that undergo acceleration. 


Summary 


e Relativity is the study of how observers in different reference frames 
measure the same event. 

¢ Modern relativity is divided into two parts. Special relativity deals 
with observers in uniform (unaccelerated) motion, whereas general 
relativity includes accelerated relative motion and gravity. Modern 
relativity is consistent with all empirical evidence thus far and, in the 
limit of low velocity and weak gravitation, gives close agreement with 
the predictions of classical (Galilean) relativity. 

e An inertial frame of reference is a reference frame in which a body at 
rest remains at rest and a body in motion moves at a constant speed in 
a straight line unless acted upon by an outside force. 

¢ Modern relativity is based on Ejinstein’s two postulates. The first 
postulate of special relativity is that the laws of physics are the same in 
all inertial frames of reference. The second postulate of special 
relativity is that the speed of light c is the same in all inertial frames of 
reference, independent of the relative motion of the observer and the 
light source. 

e The Michelson-Morley experiment demonstrated that the speed of 
light in a vacuum is independent of the motion of Earth about the sun. 


Conceptual Questions 


Exercise: 


Problem: 


Which of Einstein’s postulates of special relativity includes a concept 
that does not fit with the ideas of classical physics? Explain. 


Solution: 


the second postulate, involving the speed of light; classical physics 
already included the idea that the laws of mechanics, at least, were the 
same in all inertial frames, but the velocity of a light pulse was 
different in different frames moving with respect to each other 


Exercise: 


Problem: 


Is Earth an inertial frame of reference? Is the sun? Justify your 
response. 


Exercise: 


Problem: 


When you are flying in a commercial jet, it may appear to you that the 
airplane is stationary and Earth is moving beneath you. Is this point of 
view valid? Discuss briefly. 


Solution: 


yes, provided the plane is flying at constant velocity relative to the 
Earth; in that case, an object with no force acting on it within the plane 
has no change in velocity relative to the plane and no change in 
velocity relative to the Earth; both the plane and the ground are inertial 
frames for describing the motion of the object 


Glossary 


first postulate of special relativity 
laws of physics are the same in all inertial frames of reference 


Galilean relativity 
if an observer measures a velocity in one frame of reference, and that 
frame of reference is moving with a velocity past a second reference 
frame, an observer in the second frame measures the original velocity 
as the vector sum of these velocities 


inertial frame of reference 
reference frame in which a body at rest remains at rest and a body in 
motion moves at a constant speed in a straight line unless acted on by 
an outside force 


Michelson-Morley experiment 
investigation performed in 1887 that showed that the speed of light in 
a vacuum is the same in all frames of reference from which it is 
viewed 


rest frame 
frame of reference in which the observer is at rest 


second postulate of special relativity 
light travels in a vacuum with the same speed c in any direction in all 
inertial frames 


special theory of relativity 
theory that Albert Einstein proposed in 1905 that assumes all the laws 
of physics have the same form in every inertial frame of reference, and 
that the speed of light is the same within all inertial frames 


Relativity of Simultaneity 
By the end of this section, you will be able to: 


e Show from Einstein's postulates that two events measured as 
simultaneous in one inertial frame are not necessarily simultaneous in 
all inertial frames. 

e Describe how simultaneity is a relative concept for observers in 
different inertial frames in relative motion. 


Do time intervals depend on who observes them? Intuitively, it seems that 
the time for a process, such as the elapsed time for a foot race ([link]), 
should be the same for all observers. In everyday experiences, 
disagreements over elapsed time have to do with the accuracy of measuring 
time. No one would be likely to argue that the actual time interval was 
different for the moving runner and for the stationary clock displayed. 
Carefully considering just how time is measured, however, shows that 
elapsed time does depends on the relative motion of an observer with 
respect to the process being measured. 


Elapsed time for a foot race is the same for all observers, but at 


relativistic speeds, elapsed time depends on the motion of the observer 
relative to the location where the process being timed occurs. (credit: 
"Jason Edward Scott Bain"/Flickr) 


Consider how we measure elapsed time. If we use a stopwatch, for 
example, how do we know when to start and stop the watch? One method is 
to use the arrival of light from the event. For example, if you’re in a moving 
car and observe the light arriving from a traffic signal change from green to 
red, you know it’s time to step on the brake pedal. The timing is more 
accurate if some sort of electronic detection is used, avoiding human 
reaction times and other complications. 


Now suppose two observers use this method to measure the time interval 
between two flashes of light from flash lamps that are a distance apart 
({link]). An observer A is seated midway on a rail car with two flash lamps 
at opposite sides equidistant from her. A pulse of light is emitted from each 
flash lamp and moves toward observer A, shown in frame (a) of the figure. 
The rail car is moving rapidly in the direction indicated by the velocity 
vector in the diagram. An observer B standing on the platform is facing the 
rail car as it passes and observes both flashes of light reach him 
simultaneously, as shown in frame (c). He measures the distances from 
where he saw the pulses originate, finds them equal, and concludes that the 
pulses were emitted simultaneously. 


However, because of Observer A’s motion, the pulse from the right of the 
railcar, from the direction the car is moving, reaches her before the pulse 
from the left, as shown in frame (b). She also measures the distances from 
within her frame of reference, finds them equal, and concludes that the 
pulses were not emitted simultaneously. 


The two observers reach conflicting conclusions about whether the two 
events at well-separated locations were simultaneous. Both frames of 
reference are valid, and both conclusions are valid. Whether two events at 
separate locations are simultaneous depends on the motion of the observer 
relative to the locations of the events. 


observer B 
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observer B 


(Cc) 


(a) Two pulses of light are emitted simultaneously relative to observer 
B. (c) The pulses reach observer B’s position simultaneously. (b) 
Because of A’s motion, she sees the pulse from the right first and 

concludes the bulbs did not flash simultaneously. Both conclusions are 

correct. 


Here, the relative velocity between observers affects whether two events a 
distance apart are observed to be simultaneous. Simultaneity is not absolute. 
We might have guessed (incorrectly) that if light is emitted simultaneously, 
then two observers halfway between the sources would see the flashes 
simultaneously. But careful analysis shows this cannot be the case if the 
speed of light is the same in all inertial frames. 


This type of thought experiment (in German, “Gedankenexperiment”) 
shows that seemingly obvious conclusions must be changed to agree with 
the postulates of relativity. The validity of thought experiments can only be 
determined by actual observation, and careful experiments have repeatedly 
confirmed Einstein’s theory of relativity. 


Summary 


e Two events are defined to be simultaneous if an observer measures 
them as occurring at the same time (such as by receiving light from the 
events). 

¢ Two events at locations a distance apart that are simultaneous for an 
observer at rest in one frame of reference are not necessarily 
simultaneous for an observer at rest in a different frame of reference. 


Time Dilation 
By the end of this section, you will be able to: 


e Explain how time intervals can be measured differently in different 
reference frames. 

¢ Describe how to distinguish a proper time interval from a dilated time 
interval. 

e Describe the significance of the muon experiment. 

e Explain why the twin paradox is not a contradiction. 

¢ Calculate time dilation given the speed of an object in a given frame. 


The analysis of simultaneity shows that Einstein’s postulates imply an 
important effect: Time intervals have different values when measured in 
different inertial frames. Suppose, for example, an astronaut measures the 
time it takes for a pulse of light to travel a distance perpendicular to the 
direction of his ship’s motion (relative to an earthbound observer), bounce 
off a mirror, and return ([link]). How does the elapsed time that the 
astronaut measures in the spacecraft compare with the elapsed time that an 
earthbound observer measures by observing what is happening in the 
spacecraft? 


Examining this question leads to a profound result. The elapsed time for a 
process depends on which observer is measuring it. In this case, the time 
measured by the astronaut (within the spaceship where the astronaut is at 
rest) is smaller than the time measured by the earthbound observer (to 
whom the astronaut is moving). The time elapsed for the same process is 
different for the observers, because the distance the light pulse travels in the 
astronaut’s frame is smaller than in the earthbound frame, as seen in [link]. 
Light travels at the same speed in each frame, so it takes more time to travel 
the greater distance in the earthbound frame. 


Mirror 


| : 
Beginning Ending 
Receiver event EQ At event 


Light 
source Observer __ 
é on Earth 


(a) (b) 


(c) 


(a) An astronaut measures the time Av for light to travel distance 2D 
in the astronaut’s frame. (b) A NASA scientist on Earth sees the light 
follow the longer path 2s and take a longer time At. (c) These 
triangles are used to find the relationship between the two distances D 
and s. 


Note: 
Time Dilation 


Time dilation is the lengthening of the time interval between two events 
for an observer in an inertial frame that is moving with respect to the rest 
frame of the events (in which the events occur at the same location). 


To quantitatively compare the time measurements in the two inertial frames, 
we can relate the distances in [link] to each other, then express each 
distance in terms of the time of travel (respectively either At or Ar) of the 
pulse in the corresponding reference frame. The resulting equation can then 
be solved for At in terms of Ar. 


The lengths D and L in [link] are the sides of a right triangle with 
hypotenuse s. From the Pythagorean theorem, 
Equation: 


s = D+ 1. 


The lengths 2s and 2L are, respectively, the distances that the pulse of light 
and the spacecraft travel in time At in the earthbound observer’s frame. 
The length D is the distance that the light pulse travels in time Az in the 
astronaut’s frame. This gives us three equations: 

Equation: 


28 = CAt 2 = vA 2D = ch7. 


Note that we used Einstein’s second postulate by taking the speed of light to 
be c in both inertial frames. We substitute these results into the previous 
expression from the Pythagorean theorem: 

Equation: 
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Then we rearrange to obtain 


Equation: 


(cAt)? — (vAt)? = (cAr)?. 


Finally, solving for At in terms of Av gives us 


Note: 
Equation: 
A 
SS 
= (ley 

This is equivalent to 

Equation: 

At = yAr, 


where 7 is the relativistic factor (often called the Lorentz factor) given by 


Note: 
Equation: 


and v and c are the speeds of the moving observer and light, respectively. 


Note the asymmetry between the two measurements. Only one of them is a 
measurement of the time interval between two events—the emission and 
arrival of the light pulse—at the same position. It is a measurement of the 
time interval in the rest frame of a single clock. The measurement in the 
earthbound frame involves comparing the time interval between two events 
that occur at different locations. The time interval between events that occur 
at a single location has a separate name to distinguish it from the time 
measured by the earthbound observer, and we use the separate symbol Ar 
to refer to it throughout this chapter. 


Note: 

Proper Time 

The proper time interval Av between two events is the time interval 
measured by an observer for whom both events occur at the same location. 


The equation relating At and Av is truly remarkable. First, as stated earlier, 
elapsed time is not the same for different observers moving relative to one 
another, even though both are in inertial frames. A proper time interval Ar 
for an observer who, like the astronaut, is moving with the apparatus, is 
smaller than the time interval for other observers. It is the smallest possible 
measured time between two events. The earthbound observer sees time 
intervals within the moving system as dilated (i.e., lengthened) relative to 
how the observer moving relative to Earth sees them within the moving 
system. Alternatively, according to the earthbound observer, less time 
passes between events within the moving frame. Note that the shortest 
elapsed time between events is in the inertial frame in which the observer 
sees the events (e.g., the emission and arrival of the light signal) occur at 
the same point. 


This time effect is real and is not caused by inaccurate clocks or improper 
measurements. Time-interval measurements of the same event differ for 
observers in relative motion. The dilation of time is an intrinsic property of 


time itself. All clocks moving relative to an observer, including biological 
clocks, such as a person’s heartbeat, or aging, are observed to run more 
slowly compared with a clock that is stationary relative to the observer. 


Note that if the relative velocity is much less than the speed of light 
(v<<c), then v*/c? is extremely small, and the elapsed times At and Ar 
are nearly equal. At low velocities, physics based on modern relativity 
approaches classical physics—everyday experiences involve very small 
relativistic effects. However, for speeds near the speed of light, v” j c” is 


close to one, so 1/1 — v?/c? is very small and At becomes significantly 
larger than Ar. 


Half-Life of a Muon 


There is considerable experimental evidence that the equation At = yA is 
correct. One example is found in cosmic ray particles that continuously rain 
down on Earth from deep space. Some collisions of these particles with 
nuclei in the upper atmosphere result in short-lived particles called muons. 
The half-life (amount of time for half of a material to decay) of a muon is 
1.52 us when it is at rest relative to the observer who measures the half-life. 
This is the proper time interval Ar. This short time allows very few muons 
to reach Earth’s surface and be detected if Newtonian assumptions about 
time and space were correct. However, muons produced by cosmic ray 
particles have a range of velocities, with some moving near the speed of 
light. It has been found that the muon’s half-life as measured by an 
earthbound observer (At) varies with velocity exactly as predicted by the 
equation At = yAr. The faster the muon moves, the longer it lives. We on 
Earth see the muon last much longer than its half-life predicts within its 
own rest frame. As viewed from our frame, the muon decays more slowly 
than it does when at rest relative to us. A far larger fraction of muons reach 
the ground as a result. 


Before we present the first example of solving a problem in relativity, we 
State a strategy you can use as a guideline for these calculations. 


Note: 
Problem-Solving Strategy: Relativity 


1. Make a list of what is given or can be inferred from the problem as 
stated (identify the knowns). Look in particular for information on 
relative velocity v. 

2. Identify exactly what needs to be determined in the problem (identify 
the unknowns). 

3. Make certain you understand the conceptual aspects of the problem 
before making any calculations (express the answer as an equation). 
Decide, for example, which observer sees time dilated or length 
contracted before working with the equations or using them to carry 
out the calculation. If you have thought about who sees what, who is 
moving with the event being observed, who sees proper time, and so 
on, you will find it much easier to determine if your calculation is 
reasonable. 

4. Determine the primary type of calculation to be done to find the 
unknowns identified above (do the calculation). You will find the 
section summary helpful in determining whether a length contraction, 
relativistic kinetic energy, or some other concept is involved. 


Note that you should not round off during the calculation. As noted in the 
text, you must often perform your calculations to many digits to see the 
desired effect. You may round off at the very end of the problem solution, 
but do not use a rounded number in a subsequent calculation. Also, check 
the answer to see if it is reasonable: Does it make sense? This may be more 
difficult for relativity, which has few everyday examples to provide 
experience with what is reasonable. But you can look for velocities greater 
than c or relativistic effects that are in the wrong direction (such as a time 
contraction where a dilation was expected). 


Example: 
Time Dilation in a High-Speed Vehicle 


The Hypersonic Technology Vehicle 2 (HTV-2) is an experimental rocket 
vehicle capable of traveling at 21,000 km/h (5830 m/s). If an electronic 
clock in the HTV-2 measures a time interval of exactly 1-s duration, what 
would observers on Earth measure the time interval to be? 

Strategy 

Apply the time dilation formula to relate the proper time interval of the 
signal in HTV-2 to the time interval measured on the ground. 

Solution 


a. Identify the knowns: Ar = 1s; v = 5830 m/s. 
b. Identify the unknown: At. 


c. Express the answer as an equation: 
Equation: 


TN an a 


d. Do the calculation. Use the expression for y to determine At from Ar 


Equation: 


At =———=—_—. 
5830 m/s 
i ( 3.00 x ao ) 
— 1.000000000189 s 
= 1s 180 slo s: 
Significance 


The very high speed of the HTV-2 is still only 10° times the speed of light. 
Relativistic effects for the HTV-2 are negligible for almost all purposes, 
but are not zero. 


Example: 
What Speeds are Relativistic? 


How fast must a vehicle travel for 1 second of time measured on a 
passenger’s watch in the vehicle to differ by 1% for an observer measuring 
it from the ground outside? 

Strategy 

Use the time dilation formula to find v/c for the given ratio of times. 
Solution 


a. Identify the known: 
Equation: 


AT 1, 


At 1.01 


b. Identify the unknown: v/c. 
c. Express the answer as an equation: 


Equation: 
A 
At = VJ1-v2/e2 
Ar\2 _ v2 
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d. Do the calculation: 
Equation: 


© |e 


= /1 Say sels 
= 0.14. 


Significance 

The result shows that an object must travel at very roughly 10% of the 
speed of light for its motion to produce significant relativistic time dilation 
effects. 


Example: 
Calculating At for a Relativistic Event 
Suppose a cosmic ray colliding with a nucleus in Earth’s upper atmosphere 
produces a muon that has a velocity v = 0.950c. The muon then travels at 
constant velocity and lives 2.20 pts as measured in the muon’s frame of 
reference. (You can imagine this as the muon’s internal clock.) How long 
does the muon live as measured by an earthbound observer ([link])? 
Ar At 
Elapsed muon Elapsed muon 
lifetime lifetime 


Muon created 


Muon created 


Muon decays Muon decays 


(a) Muon’s reference frame (b) Earth's reference frame 


A muon in Earth’s atmosphere lives longer as measured by an 
earthbound observer than as measured by the muon’s internal clock. 


As we will discuss later, in the muon’s reference frame, it travels a shorter 
distance than measured in Earth’s reference frame. 

Strategy 

A clock moving with the muon measures the proper time of its decay 
process, so the time we are given is Av = 2.20us. The earthbound 
observer measures At as given by the equation At = yA. Because the 
velocity is given, we can calculate the time in Earth’s frame of reference. 


Solution 


a. Identify the knowns: v = 0.950c, Ar = 2.20us. 
b. Identify the unknown: At. 
c. Express the answer as an equation. Use: 


Equation: 
RES VAT 
with 
Equation: 
if 
ay, = 
y2 


d. Do the calculation. Use the expression for y to determine At from Ar 


Equation: 


2.20us 


J 1—(0.950)? 
=) (ois: 


Remember to keep extra significant figures until the final answer. 


Significance 

One implication of this example is that because y = 3.20 at 95.0% of the 
speed of light (v = 0.950c), the relativistic effects are significant. The two 
time intervals differ by a factor of 3.20, when classically they would be the 
same. Something moving at 0.950c is said to be highly relativistic. 


Example: 

Relativistic Television 

A non-flat screen, older-style television display ((link]) works by 
accelerating electrons over a short distance to relativistic speed, and then 
using electromagnetic fields to control where the electron beam strikes a 
fluorescent layer at the front of the tube. Suppose the electrons travel at 
6.00 x 10’ m/s through a distance of 0.200 m from the start of the beam 
to the screen. (a) What is the time of travel of an electron in the rest frame 
of the television set? (b) What is the electron’s time of travel in its own rest 
frame? 


The electron beam in a cathode ray tube television display. 


Strategy for (a) 

(a) Calculate the time from vt = d. Even though the speed is relativistic, 
the calculation is entirely in one frame of reference, and relativity is 
therefore not involved. 

Solution 


a. Identify the knowns: 
Equation: 


v = 6.00 x 10’m/s;d = 0.200 m. 


b. Identify the unknown: the time of travel At. 
c. Express the answer as an equation: 


Equation: 
d 
At = —. 
7) 
d. Do the calculation: 
Equation: 
f= 0.200 m 
6.00 x 10’ m/s 
Sew ihe ee 
Significance 


The time of travel is extremely short, as expected. Because the calculation 
is entirely within a single frame of reference, relativity is not involved, 
even though the electron speed is close to c. 

Strategy for (b) 

(b) In the frame of reference of the electron, the vacuum tube is moving 
and the electron is stationary. The electron-emitting cathode leaves the 
electron and the front of the vacuum tube strikes the electron with the 
electron at the same location. Therefore we use the time dilation formula to 
relate the proper time in the electron rest frame to the time in the television 
frame. 

Solution 


a. Identify the knowns (from part a): 
Equation: 


Re i338) 10 677 1600 10 my sd 0-20 0a, 


b. Identify the unknown: T. 


c. Express the answer as an equation: 
Equation: 


= ae NG 


Ar = At/1—-v?/e. 


d. Do the calculation: 
Equation: 


6.00 x 10’m/s i 


= -9 
Ar = (3.33 x 10 s) 1— aaa 


= 3.26 x 107°s. 


Significance 

The time of travel is shorter in the electron frame of reference. Because the 
problem requires finding the time interval measured in different reference 
frames for the same process, relativity is involved. If we had tried to 
calculate the time in the electron rest frame by simply dividing the 0.200 m 
by the speed, the result would be slightly incorrect because of the 
relativistic speed of the electron. 


Note: 
Exercise: 


Problem: Check Your Understanding What is y if v = 0.650c? 


Solution: 


The Twin Paradox 


An intriguing consequence of time dilation is that a space traveler moving 
at a high velocity relative to Earth would age less than the astronaut’s 
earthbound twin. This is often known as the twin paradox. Imagine the 
astronaut moving at such a velocity that ~ = 30.0, as in [link]. A trip that 
takes 2.00 years in her frame would take 60.0 years in the earthbound twin’s 
frame. Suppose the astronaut travels 1.00 year to another star system, 
briefly explores the area, and then travels 1.00 year back. An astronaut who 
was 40 years old at the start of the trip would be would be 42 when the 
spaceship returns. Everything on Earth, however, would have aged 60.0 
years. The earthbound twin, if still alive, would be 100 years old. 


The situation would seem different to the astronaut in [link]. Because 
motion is relative, the spaceship would seem to be stationary and Earth 
would appear to move. (This is the sensation you have when flying in a jet.) 
Looking out the window of the spaceship, the astronaut would see time 
slow down on Earth by a factor of y = 30.0. Seen from the spaceship, the 
earthbound sibling will have aged only 2/30, or 0.07, of a year, whereas the 
astronaut would have aged 2.00 years. 


At start of trip, both twins are same age 


Ship travels at 
relativistic speed 


Q ¢" |_\ 
At end of trip, Earthbound twin 
has aged more than traveling twin 


The twin paradox consists of the conflicting 
conclusions about which twin ages more as a result of 
a long space journey at relativistic speed. 


The paradox here is that the two twins cannot both be correct. As with all 
paradoxes, conflicting conclusions come from a false premise. In fact, the 
astronaut’s motion is significantly different from that of the earthbound 
twin. The astronaut accelerates to a high velocity and then decelerates to 
view the star system. To return to Earth, she again accelerates and 
decelerates. The spacecraft is not in a single inertial frame to which the time 
dilation formula can be directly applied. That is, the astronaut twin changes 
inertial references. The earthbound twin does not experience these 


accelerations and remains in the same inertial frame. Thus, the situation is 
not symmetric, and it is incorrect to claim that the astronaut observes the 
same effects as her twin. The lack of symmetry between the twins will be 
still more evident when we analyze the journey later in this chapter in terms 
of the path the astronaut follows through four-dimensional space-time. 


In 1971, American physicists Joseph Hafele and Richard Keating verified 
time dilation at low relative velocities by flying extremely accurate atomic 
clocks around the world on commercial aircraft. They measured elapsed 
time to an accuracy of a few nanoseconds and compared it with the time 
measured by clocks left behind. Hafele and Keating’s results were within 
experimental uncertainties of the predictions of relativity. Both special and 
general relativity had to be taken into account, because gravity and 
accelerations were involved as well as relative motion. 


Note: 
Exercise: 


Problem: 
Check Your Understanding a. A particle travels at 1.90 x 10°m/s 


and lives 2.10 x 10 8s when at rest relative to an observer. How 
long does the particle live as viewed in the laboratory? 


Solution: 
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(3.00 x 108 m/s)? 


Exercise: 


Problem: 


b. Spacecraft A and B pass in opposite directions at a relative speed of 
4.00 x 10’m/s. An internal clock in spacecraft A causes it to emit a 
radio signal for 1.00 s. The computer in spacecraft B corrects for the 
beginning and end of the signal having traveled different distances, to 
calculate the time interval during which ship A was emitting the 
signal. What is the time interval that the computer in spacecraft B 
calculates? 


Solution: 


b. Only the relative speed of the two spacecraft matters because there 
is no absolute motion through space. The signal is emitted from a 
fixed location in the frame of reference of A, so the proper time 
interval of its emission is 7 = 1.00 s. The duration of the signal 
measured from frame of reference B is then 
22 = 
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(3.00 x 108 m/s)” 


Summary 


e Two events are defined to be simultaneous if an observer measures 
them as occurring at the same time. They are not necessarily 
simultaneous to all observers—simultaneity is not absolute. 

¢ Time dilation is the lengthening of the time interval between two 
events when seen in a moving inertial frame rather than the rest frame 
of the events (in which the events occur at the same location). 

e Observers moving at a relative velocity v do not measure the same 
elapsed time between two events. Proper time A7 is the time measured 
in the reference frame where the start and end of the time interval 
occur at the same location. The time interval At measured by an 
observer who sees the frame of events moving at speed v is related to 
the proper time interval Az of the events by the equation: 


Equation: 


where 
Equation: 


ih 


yi-3 


e The premise of the twin paradox is faulty because the traveling twin is 
accelerating. The journey is not symmetrical for the two twins. 

e Time dilation is usually negligible at low relative velocities, but it does 
occur, and it has been verified by experiment. 

e The proper time is the shortest measure of any time interval. Any 
observer who is moving relative to the system being observed 
measures a time interval longer than the proper time. 


2 


Conceptual Questions 


Exercise: 
Problem: 
(a) Does motion affect the rate of a clock as measured by an observer 


moving with it? (b) Does motion affect how an observer moving 
relative to a clock measures its rate? 


Exercise: 
Problem: 
To whom does the elapsed time for a process seem to be longer, an 


observer moving relative to the process or an observer moving with the 
process? Which observer measures the interval of proper time? 


Solution: 


The observer moving with the process sees its interval of proper time, 
which is the shortest seen by any observer. 


Exercise: 


Problem: 
(a) How could you travel far into the future of Earth without aging 
significantly? (b) Could this method also allow you to travel into the 
past? 

Problems 


Exercise: 


Problem: (a) What is y if v = 0.250c? (b) If v = 0.500c? 
Solution: 


ae 1.03283.b, 1,15 


Exercise: 


Problem: (a) What is yy if v = 0.100c? (b) If v = 0.900c? 
Exercise: 
Problem: 
Particles called 7-mesons are produced by accelerator beams. If these 
particles travel at 2.70 x 10°m/s and live 2.60 x 1078s when at 


rest relative to an observer, how long do they live as viewed in the 
laboratory? 


Solution: 


5.96 x 108s 


Exercise: 
Problem: 
Suppose a particle called a kaon is created by cosmic radiation striking 
the atmosphere. It moves by you at 0.980c, and it lives 1.24 x 10°8s 


when at rest relative to an observer. How long does it live as you 
observe it? 


Exercise: 
Problem: 
A neutral 7-meson is a particle that can be created by accelerator 
beams. If one such particle lives 1.40 x 10°'°s as measured in the 


laboratory, and 0.840 x 10°!°s when at rest relative to an observer, 
what is its velocity relative to the laboratory? 


Solution: 


0.800c 
Exercise: 


Problem: 


A neutron lives 900 s when at rest relative to an observer. How fast is 
the neutron moving relative to an observer who measures its life span 
to be 2065 s? 


Exercise: 


Problem: 


If relativistic effects are to be less than 1%, then y must be less than 
1.01. At what relative velocity is y = 1.01? 


Solution: 


0.140c 


Exercise: 


Problem: 


If relativistic effects are to be less than 3%, then -y must be less than 
1.03. At what relative velocity is y = 1.03? 


Glossary 


proper time 
AT is the time interval measured by an observer who sees the 
beginning and end of the process that the time interval measures occur 
at the same location 


time dilation 
lengthening of the time interval between two events when seen in a 
moving inertial frame rather than the rest frame of the events (in which 
the events occur at the same location) 


Length Contraction 
By the end of this section, you will be able to: 


e Explain how simultaneity and length contraction are related. 
e Describe the relation between length contraction and time dilation and 
use it to derive the length-contraction equation. 


The length of the train car in [link] is the same for all the passengers. All of 
them would agree on the simultaneous location of the two ends of the car 
and obtain the same result for the distance between them. But simultaneous 
events in one inertial frame need not be simultaneous in another. If the train 
could travel at relativistic speeds, an observer on the ground would see the 
simultaneous locations of the two endpoints of the car at a different distance 
apart than observers inside the car. Measured distances need not be the 
same for different observers when relativistic speeds are involved. 
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People might describe distances differently, but at 
relativistic speeds, the distances really are different. 
(credit: “russavia”/Flickr) 


Proper Length 


Two observers passing each other always see the same value of their 
relative speed. Even though time dilation implies that the train passenger 
and the observer standing alongside the tracks measure different times for 
the train to pass, they still agree that relative speed, which is distance 
divided by elapsed time, is the same. If an observer on the ground and one 
on the train measure a different time for the length of the train to pass the 
ground observer, agreeing on their relative speed means they must also see 
different distances traveled. 


The muon discussed in [link] illustrates this concept ([link]). To an observer 
on Earth, the muon travels at 0.950c for 7.05 us from the time it is produced 
until it decays. Therefore, it travels a distance relative to Earth of: 
Equation: 


Ly = vAt = (0.950)(3.00 x 10°m/s)(7.05 x 10°°s) = 2.01 km. 


In the muon frame, the lifetime of the muon is 2.20 us. In this frame of 
reference, the Earth, air, and ground have only enough time to travel: 
Equation: 


L = vAr = (0.950)(3.00 x 108 m/s)(2.20 x 10-°s) km = 0.627 km. 


The distance between the same two events (production and decay of a 
muon) depends on who measures it and how they are moving relative to it. 


Note: 

Proper Length 

Proper length Lo is the distance between two points measured by an 
observer who is at rest relative to both of the points. 


The earthbound observer measures the proper length Lo because the points 
at which the muon is produced and decays are stationary relative to Earth. 
To the muon, Earth, air, and clouds are moving, so the distance L it sees is 
not the proper length. 


2 ; i 


(a) (b) 


(a) The earthbound observer sees the muon travel 2.01 km. (b) The 
same path has length 0.627 km seen from the muon’s frame of 
reference. The Earth, air, and clouds are moving relative to the muon 
in its frame, and have smaller lengths along the direction of travel. 


Length Contraction 


To relate distances measured by different observers, note that the velocity 
relative to the earthbound observer in our muon example is given by 
Equation: 


Ly 
C= 
At 
The time relative to the earthbound observer is At, because the object being 
timed is moving relative to this observer. The velocity relative to the 


moving observer is given by 
Equation: 


The moving observer travels with the muon and therefore observes the 
proper time Avr. The two velocities are identical; thus, 
Equation: 


Lo L 


At Ar 


We know that At = yA. Substituting this equation into the relationship 
above gives 
Equation: 


Substituting for y gives an equation relating the distances measured by 
different observers. 


Note: 

Length Contraction 

Length contraction is the decrease in the measured length of an object 
from its proper length when measured in a reference frame that is moving 
with respect to the object: 

Equation: 


where Lg is the length of the object in its rest frame, and L is the length in 
the frame moving with velocity v. 


If we measure the length of anything moving relative to our frame, we find 
its length L to be smaller than the proper length Lg that would be measured 
if the object were stationary. For example, in the muon’s rest frame, the 
distance Earth moves between where the muon was produced and where it 
decayed is shorter than the distance traveled as seen from the Earth’s frame. 
Those points are fixed relative to Earth but are moving relative to the muon. 
Clouds and other objects are also contracted along the direction of motion 
as seen from muon’s rest frame. 


Thus, two observers measure different distances along their direction of 
relative motion, depending on which one is measuring distances between 
objects at rest. 


But what about distances measured in a direction perpendicular to the 
relative motion? Imagine two observers moving along their x-axes and 
passing each other while holding meter sticks vertically in the y-direction. 
[link] shows two meter sticks M and M/ that are at rest in the reference 
frames of two boys S and S/, respectively. A small paintbrush is attached to 
the top (the 100-cm mark) of stick M/. Suppose that S/ is moving to the 
right at a very high speed v relative to S, and the sticks are oriented so that 
they are perpendicular, or transverse, to their relative velocity vector. The 
sticks are held so that as they pass each other, their lower ends (the 0-cm 
marks) coincide. Assume that when S looks at his stick M afterwards, he 
finds a line painted on it, just below the top of the stick. Because the brush 
is attached to the top of the other boy’s stick M/, S can only conclude that 
stick Mr is less than 1.0 m long. 


100 cm 


Meter sticks M and M’ are stationary in the reference frames 
of observers S and S/, respectively. As the sticks pass, a 
small brush attached to the 100-cm mark of M/ paints a line 
on M. 


Now when the boys approach each other, S/, like S, sees a meter stick 
moving toward him with speed v. Because their situations are symmetric, 
each boy must make the same measurement of the stick in the other frame. 
So, if S measures stick M/ to be less than 1.0 m long, S/ must measure stick 
M to be also less than 1.0 m long, and S/ must see his paintbrush pass over 
the top of stick M and not paint a line on it. In other words, after the same 
event, one boy sees a painted line on a stick, while the other does not see 
such a line on that same stick! 


Einstein’s first postulate requires that the laws of physics (as, for example, 
applied to painting) predict that S and S/, who are both in inertial frames, 
make the same observations; that is, S and S/ must either both see a line 
painted on stick M, or both not see that line. We are therefore forced to 
conclude our original assumption that S saw a line painted below the top of 
his stick was wrong! Instead, S finds the line painted right at the 100-cm 
mark on M. Then both boys will agree that a line is painted on M, and they 
will also agree that both sticks are exactly 1 m long. We conclude then that 


measurements of a transverse length must be the same in different inertial 
frames. 


Example: 

Calculating Length Contraction 

Suppose an astronaut, such as the twin in the twin paradox discussion, 
travels so fast that y = 30.00. (a) The astronaut travels from Earth to the 
nearest star system, Alpha Centauri, 4.300 light years (ly) away as 
measured by an earthbound observer. How far apart are Earth and Alpha 
Centauri as measured by the astronaut? (b) In terms of c, what is the 
astronaut’s velocity relative to Earth? You may neglect the motion of Earth 
relative to the sun ((link]). 


a At Do Alpha 


Earth Centauri 
Lo , 
Lo 
v= —_ 
At (a) 


& Ar 


Centauri 


(a) The earthbound observer measures the proper distance between 
Earth and Alpha Centauri. (b) The astronaut observes a length 
contraction because Earth and Alpha Centauri move relative to her 


ship. She can travel this shorter distance in a smaller time (her proper 
time) without exceeding the speed of light. 


Strategy 

First, note that a light year (ly) is a convenient unit of distance on an 
astronomical scale—it is the distance light travels in a year. For part (a), 
the 4.300-ly distance between Alpha Centauri and Earth is the proper 
distance Lo, because it is measured by an earthbound observer to whom 
both stars are (approximately) stationary. To the astronaut, Earth and Alpha 
Centauri are moving past at the same velocity, so the distance between 
them is the contracted length L. In part (b), we are given 7, so we can find 
v by rearranging the definition of to express v in terms of c. 

Solution for (a) 

For part (a): 


a. Identify the knowns: Lp = 4.300 ly; vy = 30.00. 
b. Identify the unknown: L. 
c. Express the answer as an equation: L = at 


d. Do the calculation: 
Equation: 


i ee 
‘i: 


4.300 ly 
30.00 


= 0.1433 ly. 


Solution for (b) 
For part (b): 


a. Identify the known: y = 30.00. 

b. Identify the unknown: v in terms of c. 

c. Express the answer as an equation. Start with: 
Equation: 


Then solve for the unknown v/c by first squaring both sides and then 


rearranging: 

Equation: 
oe 1 
or ee 
ve il 
= ae 
Dees es 
c —- 1 a . 


d. Do the calculation: 


Equation: 
2 = 1-5 
= aa (30.00)? oy 

= 0.99944 
or 
Equation: 

v = 0.9994 c. 

Significance 


Remember not to round off calculations until the final answer, or you could 
get erroneous results. This is especially true for special relativity 
calculations, where the differences might only be revealed after several 
decimal places. The relativistic effect is large here (y = 30.00), and we 
see that v is approaching (not equaling) the speed of light. Because the 


distance as measured by the astronaut is so much smaller, the astronaut can 
travel it in much less time in her frame. 


People traveling at extremely high velocities could cover very large 
distances (thousands or even millions of light years) and age only a few 
years on the way. However, like emigrants in past centuries who left their 
home, these people would leave the Earth they know forever. Even if they 
returned, thousands to millions of years would have passed on Earth, 
obliterating most of what now exists. There is also a more serious practical 
obstacle to traveling at such velocities; immensely greater energies would 
be needed to achieve such high velocities than classical physics predicts can 
be attained. This will be discussed later in the chapter. 


Why don’t we notice length contraction in everyday life? The distance to 
the grocery store does not seem to depend on whether we are moving or 


a ; 2 = 
not. Examining the equation L = Lo / 1 — 4, we see that at low velocities 


(v<<c), the lengths are nearly equal, which is the classical expectation. 
But length contraction is real, if not commonly experienced. For example, a 
charged particle such as an electron traveling at relativistic velocity has 
electric field lines that are compressed along the direction of motion as seen 
by a stationary observer ([link]). As the electron passes a detector, such as a 
coil of wire, its field interacts much more briefly, an effect observed at 
particle accelerators such as the 3-km-long Stanford Linear Accelerator 
(SLAC). In fact, to an electron traveling down the beam pipe at SLAC, the 
accelerator and Earth are all moving by and are length contracted. The 
relativistic effect is so great that the accelerator is only 0.5 m long to the 
electron. It is actually easier to get the electron beam down the pipe, 
because the beam does not have to be as precisely aimed to get down a 
short pipe as it would to get down a pipe 3 km long. This, again, is an 
experimental verification of the special theory of relativity. 


The electric field lines of a high-velocity charged 
particle are compressed along the direction of 
motion by length contraction, producing an 
observably different signal as the particle goes 
through a coil. 


Note: 
Exercise: 


Problem: 


Check Your Understanding A particle is traveling through Earth’s 
atmosphere at a speed of 0.750c. To an earthbound observer, the 
distance it travels is 2.50 km. How far does the particle travel as 
viewed from the particle’s reference frame? 


Solution: 


2 
L=Iy/1- & = (2.50km)y1— “5 = 1.65km 


Summary 


e All observers agree upon relative speed. 

e Distance depends on an observer’s motion. Proper length Lg is the 
distance between two points measured by an observer who is at rest 
relative to both of the points. 

e Length contraction is the decrease in observed length of an object from 
its proper length LZ to length L when its length is observed in a 
reference frame where it is traveling at speed v. 

e The proper length is the longest measurement of any length interval. 
Any observer who is moving relative to the system being observed 
measures a length shorter than the proper length. 


Conceptual Questions 


Exercise: 
Problem: 
To whom does an object seem greater in length, an observer moving 


with the object or an observer moving relative to the object? Which 
observer measures the object’s proper length? 


Solution: 
The length of an object is greatest to an observer who is moving with 
the object, and therefore measures its proper length. 
Exercise: 
Problem: 
Relativistic effects such as time dilation and length contraction are 


present for cars and airplanes. Why do these effects seem strange to 
us? 


Exercise: 


Problem: 


Suppose an astronaut is moving relative to Earth at a significant 
fraction of the speed of light. (a) Does he observe the rate of his clocks 
to have slowed? (b) What change in the rate of earthbound clocks does 
he see? (c) Does his ship seem to him to shorten? (d) What about the 
distance between two stars that lie in the direction of his motion? (e) 
Do he and an earthbound observer agree on his velocity relative to 
Earth? 


Solution: 


a. No, not within the astronaut’s own frame of reference. b. He sees 
Earth clocks to be in their rest frame moving by him, and therefore 
sees them slowed. c. No, not within the astronaut’s own frame of 
reference. d. Yes, he measures the distance between the two stars to be 
shorter. e. The two observers agree on their relative speed. 


Problems 


Exercise: 


Problem: 


A spaceship, 200 m long as seen on board, moves by the Earth at 
0.970c. What is its length as measured by an earthbound observer? 


Solution: 


48.6 m 
Exercise: 
Problem: 
How fast would a 6.0 m-long sports car have to be going past you in 
order for it to appear only 5.5 m long? 


Exercise: 


Problem: 


(a) How far does the muon in [link] travel according to the earthbound 
observer? (b) How far does it travel as viewed by an observer moving 
with it? Base your calculation on its velocity relative to the Earth and 
the time it lives (proper time). (c) Verify that these two distances are 
related through length contraction y = 3.20. 


Solution: 


Using the values given in [link]: a. 1.39 km; b. 0.433 km; c. 0.433 km 
Exercise: 

Problem: 

(a) How long would the muon in [link] have lived as observed on 


Earth if its velocity was 0.0500c? (b) How far would it have traveled 
as observed on Earth? (c) What distance is this in the muon’s frame? 


Exercise: 


Problem: 


Unreasonable Results A spaceship is heading directly toward Earth at 
a velocity of 0.800c. The astronaut on board claims that he can send a 
canister toward the Earth at 1.20c relative to Earth. (a) Calculate the 
velocity the canister must have relative to the spaceship. (b) What is 
unreasonable about this result? (c) Which assumptions are 
unreasonable or inconsistent? 


Solution: 


a. 10.0c; b. The resulting speed of the canister is greater than c, an 
impossibility. c. It is unreasonable to assume that the canister will 
move toward the earth at 1.20c. 


Glossary 


length contraction 
decrease in observed length of an object from its proper length Li to 
length L when its length is observed in a reference frame where it is 
traveling at speed v 


proper length 
Lo; the distance between two points measured by an observer who is at 
rest relative to both of the points; for example, earthbound observers 
measure proper length when measuring the distance between two 
points that are stationary relative to Earth 


The Lorentz Transformation 


e Describe the Galilean transformation of classical mechanics, relating 
the position, time, velocities, and accelerations measured in different 
inertial frames 

¢ Derive the corresponding Lorentz transformation equations, which, in 
contrast to the Galilean transformation, are consistent with special 
relativity 

e Explain the Lorentz transformation and many of the features of 
relativity in terms of four-dimensional space-time 


We have used the postulates of relativity to examine, in particular examples, 
how observers in different frames of reference measure different values for 
lengths and the time intervals. We can gain further insight into how the 
postulates of relativity change the Newtonian view of time and space by 
examining the transformation equations that give the space and time 
coordinates of events in one inertial reference frame in terms of those in 
another. We first examine how position and time coordinates transform 
between inertial frames according to the view in Newtonian physics. Then 
we examine how this has to be changed to agree with the postulates of 
relativity. Finally, we examine the resulting Lorentz transformation 
equations and some of their consequences in terms of four-dimensional 
space-time diagrams, to support the view that the consequences of special 
relativity result from the properties of time and space itself, rather than 
electromagnetism. 


The Galilean Transformation Equations 


An event is specified by its location and time (x, y, z, t) relative to one 
particular inertial frame of reference S. As an example, (x, y, z, t) could 
denote the position of a particle at time t, and we could be looking at these 
positions for many different times to follow the motion of the particle. 
Suppose a second frame of reference S/ moves with velocity v with respect 
to the first. For simplicity, assume this relative velocity is along the x-axis. 
The relation between the time and coordinates in the two frames of 
reference is then 

Equation: 


zr=azeu, y=yl, z= Zi. 


Implicit in these equations is the assumption that time measurements made 
by observers in both S and S/ are the same. That is, 
Equation: 


t=tl. 


These four equations are known collectively as the Galilean 
transformation. 


We can obtain the Galilean velocity and acceleration transformation 
equations by differentiating these equations with respect to time. We use u 
for the velocity of a particle throughout this chapter to distinguish it from v, 
the relative velocity of two reference frames. Note that, for the Galilean 
transformation, the increment of time used in differentiating to calculate the 
particle velocity is the same in both frames, dt = dt/. Differentiation yields 
Equation: 


/ / / 
Ug =Uz TV, UW = Uy, Uz =U, 


and 
Equation: 

a 26 lt 
Ay =Q,, a, = a,. 


Ay = az; 


We denote the velocity of the particle by u rather than v to avoid confusion 
with the velocity v of one frame of reference with respect to the other. 
Velocities in each frame differ by the velocity that one frame has as seen 
from the other frame. Observers in both frames of reference measure the 
same value of the acceleration. Because the mass is unchanged by the 
transformation, and distances between points are uncharged, observers in 
both frames see the same forces F’ = ma acting between objects and the 
same form of Newton’s second and third laws in all inertial frames. The 
laws of mechanics are consistent with the first postulate of relativity. 


The Lorentz Transformation Equations 


The Galilean transformation nevertheless violates Einstein’s postulates, 
because the velocity equations state that a pulse of light moving with speed 
c along the x-axis would travel at speed c — v in the other inertial frame. 
Specifically, the spherical pulse has radius r = ct at time ¢ in the unprimed 
frame, and also has radius r/= ct/ at time ¢/ in the primed frame. 
Expressing these relations in Cartesian coordinates gives 

Equation: 


a? + y? 4+ 22-44? = 


2 2 2 2 


The left-hand sides of the two expressions can be set equal because both are 
zero. Because y = y/ and z = z/, we obtain 
Equation: 


2 12 


2 
ger =x i 


— ct 


This cannot be satisfied for nonzero relative velocity v of the two frames if 
we assume the Galilean transformation results in ¢ = ¢/ with x = x/+ vt. 


To find the correct set of transformation equations, assume the two 
coordinate systems S and S’ in [link]. First suppose that an event occurs at 
(x, 0, 0, t/) in Sand at (x, 0, 0, t) in S, as depicted in the figure. 


Zz z' 


An event occurs at (x, 0, 0, f) in S and at («/, 0, 0, ¢/) in $7. 
The Lorentz transformation equations relate events in the 
two systems. 


Suppose that at the instant that the origins of the coordinate systems in S 
and S/ coincide, a flash bulb emits a spherically spreading pulse of light 
starting from the origin. At time t, an observer in S finds the origin of S/ to 
be at x = vt. With the help of a friend in S, the S/ observer also measures 
the distance from the event to the origin of S/ and finds it to be 

xl,/1 — v*/c?. This follows because we have already shown the postulates 
of relativity to imply length contraction. Thus the position of the event in S 
is 

Equation: 


r = vt +an/1—v?/e2 


and 
Equation: 


The postulates of relativity imply that the equation relating distance and 
time of the spherical wave front: 
Equation: 


a +y?+27—c7t? =0 


must apply both in terms of primed and unprimed coordinates, which was 
shown above to lead to [link]: 
Equation: 


eo? — ct? = aftr. 
We combine this with the equation relating x and x/ to obtain the relation 


between t and t/: 
Equation: 


The equations relating the time and position of the events as seen in S are 
then 


Equation: 
t t+vat/c? 
4/0? /e? 
r= xzl+utl 
a/ 1—v?/c? 
y= yl 


Zz = ai. 


This set of equations, relating the position and time in the two inertial 
frames, is known as the Lorentz transformation. They are named in honor 
of H.A. Lorentz (1853-1928), who first proposed them. Interestingly, he 
justified the transformation on what was eventually discovered to be a 
fallacious hypothesis. The correct theoretical basis is Einstein’s special 
theory of relativity. 


The reverse transformation expresses the variables in S in terms of those in 
S/. Simply interchanging the primed and unprimed variables and 
substituting gives: 


Equation: 
4p Sa t—vx/c? 
A /T=w? 7? 
rh = z—vt 
VJ1—v2/e2 
yl Y 
zi Zz 
Example: 


Using the Lorentz Transformation for Time 

Spacecraft S/ is on its way to Alpha Centauri when Spacecraft S passes it 
at relative speed c/2. The captain of S/ sends a radio signal that lasts 1.2 s 
according to that ship’s clock. Use the Lorentz transformation to find the 
time interval of the signal measured by the communications officer of 
spaceship S. 

Solution 


a. Identify the known: At/= to/—ty/= 1.2s; Avl= xlg—x2= 0. 

b. Identify the unknown: At = tz — fy. 

c. Express the answer as an equation. The time signal starts as (x/, t1/) 
and stops at (x/, t2/). Note that the x/ coordinate of both events is the 
same because the clock is at rest in S/. Write the first Lorentz 
transformation equation in terms of At = ty — ty, Av = 29 — 1, 
and similarly for the primed coordinates, as: 


Equation: 


ome Ati+vAz!/c? | 


Because the position of the clock in S/ is fixed, Ax/= 0, and the time 
interval At becomes: 


Equation: 

Atl 

oN pe 
y2 
1-3 
d. Do the calculation. 
With At/= 1.2 this gives: 
Equation: 
2 
At = ihe 


Note that the Lorentz transformation reproduces the time dilation 
equation. 


Example: 

Using the Lorentz Transformation for Length 

A surveyor measures a street to be L = 100 m long in Earth frame S. Use 

the Lorentz transformation to obtain an expression for its length measured 

from a spaceship S/, moving by at speed 0.20c, assuming the x coordinates 
of the two frames coincide at time t = 0. 

Solution 


a. Identify the known: LD = 100 m; v = 0.20c; Ar = 0. 

b. Identify the unknown: L/. 

c. Express the answer as an equation. The surveyor in frame S has 
measured the two ends of the stick simultaneously, and found them at 
rest at 2 and x; a distance L = x2 — x; = 100m apart. The 
spaceship crew measures the simultaneous location of the ends of the 
sticks in their frame. To relate the lengths recorded by observers in S/ 
and S, respectively, write the second of the four Lorentz 
transformation equations as: 


Equation: 
LI UE 21 —vt 
SES) = 4G — -— —- — 
2 ; J1—v?/c? / 1-v?/c? 


Yat Se L 
VJ/1—v?/e VJ1—v?/e2 ; 


d. Do the calculation. Because x2 — x; = 100m, the length of the 
moving stick is equal to: 
Equation: 


Lr = (100 m)y/1 — v?/c? 


= (100 m) 1 = (0:20) 
= 98.0 m. 


Note that the Lorentz transformation gave the length contraction 
equation for the street. 


Example: 

Lorentz Transformation and Simultaneity 

The observer shown in [link] standing by the railroad tracks sees the two 
bulbs flash simultaneously at both ends of the 26 m long passenger car 
when the middle of the car passes him at a speed of c/2. Find the separation 


in time between when the bulbs flashed as seen by the train passenger 
seated in the middle of the car. 


An person watching a train go by observes two bulbs flash 
simultaneously at opposite ends of a passenger car. There is another 
passenger inside of the car observing the same flashes but from a 
different perspective. 


Solution 


a. Identify the known: At = 0. 
Note that the spatial separation of the two events is between the two 
lamps, not the distance of the lamp to the passenger. 

b. Identify the unknown: At/= t5 — t}. 
Again, note that the time interval is between the flashes of the lamps, 
not between arrival times for reaching the passenger. 

c. Express the answer as an equation: 
Equation: 


— Atr+vAz/c? 


4/1 —v2/c? ) 


At 


d. Do the calculation: 
Equation: 


At/+ £(26 m)/c? 


a Terre 
a 26m/s __ 26 m/s 
ING) Ss ee — ~ (300x108 m/s) 
At = 4.33% 105" 5, 


Significance 

The sign indicates that the event with the larger z2/, namely, the flash from 
the right, is seen to occur first in the S/ frame, as found earlier for this 
example, so that ft) < fj. 


Space-time 


Relativistic phenomena can be analyzed in terms of events in a four- 
dimensional space-time. When phenomena such as the twin paradox, time 
dilation, length contraction, and the dependence of simultaneity on relative 
motion are viewed in this way, they are seen to be characteristic of the 
nature of space and time, rather than specific aspects of electromagnetism. 


In three-dimensional space, positions are specified by three coordinates on a 
set of Cartesian axes, and the displacement of one point from another is 
given by: 

Equation: 


(Az, Ay, Az) = (%2 — £1, y2 — Y1, 22 — 21). 
The distance Ar between the points is 
Equation: 
Ar? = (Az)’ + (Ay)? + (Az). 


The distance Ar is invariant under a rotation of axes. If a new set of 
Cartesian axes rotated around the origin relative to the original axes are 


used, each point in space will have new coordinates in terms of the new 
axes, but the distance Ar/ given by 
Equation: 


Arr'= (Aas)? + (Ayn)? + (Az)?. 


That has the same value that Ar? had. Something similar happens with the 
Lorentz transformation in space-time. 


Define the separation between two events, each given by a set of x, y, z, and 
ct along a four-dimensional Cartesian system of axes in space-time, as 
Equation: 


(Az, Ay, Az, cAt) = (42 — £1, y2 — 91, Z2 — 21, C(t2 — t1)). 


Also define the space-time interval As between the two events as 
Equation: 


As? = (Ax)* + (Ay)? + (Az)? — (cAt)’. 


If the two events have the same value of ct in the frame of reference 
considered, As would correspond to the distance Ar between points in 
space. 


The path of a particle through space-time consists of the events (x, y, Z, ct) 
specifying a location at each time of its motion. The path through space- 
time is called the world line of the particle. The world line of a particle that 
remains at rest at the same location is a straight line that is parallel to the 
time axis. If the particle moves at constant velocity parallel to the x-axis, its 
world line would be a sloped line x = vt, corresponding to a simple 
displacement vs. time graph. If the particle accelerates, its world line is 
curved. The increment of s along the world line of the particle is given in 
differential form as 

Equation: 


ds* = (dx)? + (dy)? + (dz)? — c?(dt)’. 


Just as the distance Ar is invariant under rotation of the space axes, the 
space-time interval: 
Equation: 


As? = (Ax)* + (Ay)? + (Az)? — (cAt)’. 


is invariant under the Lorentz transformation. This follows from the 
postulates of relativity, and can be seen also by substitution of the previous 
Lorentz transformation equations into the expression for the space-time 
interval: 

Equation: 


As? = (Aa)? + (Ay)? + (Az)? - (ct)? 


2 Fay me a 
— ({ Aal+vAt 2 2? i) 
= ( aia ) + (Ayr) + (Azv) (= ) 
— (Az)? + (Ayr)? a (Az)? = (cAt)’ 
= As/?, 


In addition, the Lorentz transformation changes the coordinates of an event 
in time and space similarly to how a three-dimensional rotation changes old 
coordinates into new coordinates: 

Equation: 


Lorentz transformation Axis— rotation around z-axis 


(x, t coordinates): (x, y coordinates): 
xl = (y)x + (—6y)ct x!= (cos 0)x + (sin 0)y 
ctl= (—By)x + (y)ct yl= (—sin 6)x + (cos 0)y 


where y = rer BU] e, 


Lorentz transformations can be regarded as generalizations of spatial 
rotations to space-time. However, there are some differences between a 
three-dimensional axis rotation and a Lorentz transformation involving the 
time axis, because of differences in how the metric, or rule for measuring 
the displacements Ar and As, differ. Although Ar is invariant under 
spatial rotations and As is invariant also under Lorentz transformation, the 
Lorentz transformation involving the time axis does not preserve some 
features, such as the axes remaining perpendicular or the length scale along 
each axis remaining the same. 


Note that the quantity As? can have either sign, depending on the 
coordinates of the space-time events involved. For pairs of events that give 
it a negative sign, it is useful to define Ar? as —As?. The significance of 
Az as just defined follows by noting that in a frame of reference where the 
two events occur at the same location, we have Az = Ay = Az = O and 
therefore (from the equation for As? = —Ar’): 

Equation: 


Ar? = —As? = (At)’. 


Therefore Az is the time interval At in the frame of reference where both 
events occur at the same location. It is the same interval of proper time 
discussed earlier. It also follows from the relation between As and that Ar 
that because As is Lorentz invariant, the proper time is also Lorentz 
invariant. All observers in all inertial frames agree on the proper time 
intervals between the same two events. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Show that if a time increment dt elapses 
for an observer who sees the particle moving with velocity v, it 
corresponds to a proper time particle increment for the particle of 

dr = Yydr. 


Solution: 


Start with the definition of the proper time increment: 


dr = 4/—(ds)*/c2 = ,/dt? — (dx? + dx? + dx?)/c?. 


where (dx, dy, dx, cdt) are measured in the inertial frame of an 
observer who does not necessarily see that particle at rest. This 
therefore becomes 


dr = 1f—(ds)?/c? = dt? — | (dx)? + (dy)” + (dz)? | /e2 
Bb ere) (ore 
= dt/1—v?/2 


di = Wada. 


The light cone 


We can deal with the difficulty of visualizing and sketching graphs in four 
dimensions by imagining the three spatial coordinates to be represented 
collectively by a horizontal axis, and the vertical axis to be the ct-axis. 
Starting with a particular event in space-time as the origin of the space-time 
graph shown, the world line of a particle that remains at rest at the initial 
location of the event at the origin then is the time axis. Any plane through 
the time axis parallel to the spatial axes contains all the events that are 
simultaneous with each other and with the intersection of the plane and the 
time axis, as seen in the rest frame of the event at the origin. 


It is useful to picture a light cone on the graph, formed by the world lines of 
all light beams passing through the origin event A, as shown in [link]. The 
light cone, according to the postulates of relativity, has sides at an angle of 
45° if the time axis is measured in units of ct, and, according to the 
postulates of relativity, the light cone remains the same in all inertial 
frames. Because the event A is arbitrary, every point in the space-time 
diagram has a light cone associated with it. 


The light cone consists of all the 
world lines followed by light from 
the event A at the vertex of the 
cone. 


Consider now the world line of a particle through space-time. Any world 
line outside of the cone, such as one passing from A through C, would 
involve speeds greater than c, and would therefore not be possible. Events 
such as C that lie outside the light cone are said to have a space-like 
separation from event A. They are characterized by: 

Equation: 


Asie =(r4 —2p)* + (x4 — ap)? + (x4 —2p)’ - (cAt)? > 0. 


An event like B that lies in the upper cone is reachable without exceeding 
the speed of light in vacuum, and is characterized by 
Equation: 


AGB = (24 ae)" + (x4 payee (x4 —2p)° = (cAt)? <0. 


The event is said to have a time-like separation from A. Time-like events 
that fall into the upper half of the light cone occur at greater values of t than 
the time of the event A at the vertex and are in the future relative to A. 
Events that have time-like separation from A and fall in the lower half of 
the light cone are in the past, and can affect the event at the origin. The 
region outside the light cone is labeled as neither past nor future, but rather 
as “elsewhere.” 


For any event that has a space-like separation from the event at the origin, it 
is possible to choose a time axis that will make the two events occur at the 
same time, so that the two events are simultaneous in some frame of 
reference. Therefore, which of the events with space-like separation comes 
before the other in time also depends on the frame of reference of the 
observer. Since space-like separations can be traversed only by exceeding 
the speed of light; this violation of which event can cause the other provides 
another argument for why particles cannot travel faster than the speed of 
light, as well as potential material for science fiction about time travel. 
Similarly for any event with time-like separation from the event at the 
origin, a frame of reference can be found that will make the events occur at 
the same location. Because the relations 

Equation: 


Aso = (aa — 2p)’ + (a4 — 2p)’ + (a4 — 2p)’ — (cAt)? > 0 


and 
Equation: 


UN Ge =(r4 —2p)° + (x4 —ap)?+ (x4 =o) = (cAt)? < 0. 


are Lorentz invariant, whether two events are time-like and can be made to 
occur at the same place or space-like and can be made to occur at the same 
time is the same for all observers. All observers in different inertial frames 
of reference agree on whether two events have a time-like or space-like 
separation. 


The twin paradox seen in space-time 


The twin paradox discussed earlier involves an astronaut twin traveling at 
near light speed to a distant star system, and returning to Earth. Because of 
time dilation, the space twin is predicted to age much less than the 
earthbound twin. This seems paradoxical because we might have expected 
at first glance for the relative motion to be symmetrical and naively thought 
it possible to also argue that the earthbound twin should age less. 


To analyze this in terms of a space-time diagram, assume that the origin of 
the axes used is fixed in Earth. The world line of the earthbound twin is 
then along the time axis. 


The world line of the astronaut twin, who travels to the distant star and then 
returns, must deviate from a straight line path in order to allow a return trip. 
As seen in [link], the circumstances of the two twins are not at all 
symmetrical. Their paths in space-time are of manifestly different length. 
Specifically, the world line of the earthbound twin has length 2cAt, which 
then gives the proper time that elapses for the earthbound twin as 2A¢. The 
distance to the distant star system is Ax = vAt. The proper time that 
elapses for the space twin is 2A7 where 


Equation: 
c?Ar? = —As* = (cAt)? — (Az)?. 
This is considerably shorter than the proper time for the earthbound twin by 


the ratio 
Equation: 


consistent with the time dilation formula. The twin paradox is therefore 
seen to be no paradox at all. The situation of the two twins is not 
symmetrical in the space-time diagram. The only surprise is perhaps that 
the seemingly longer path on the space-time diagram corresponds to the 
smaller proper time interval, because of how Az and As depend on Ax 
and At. 


Earth twin 


a! 


cAt 


Space twin 


The space twin and the earthbound twin, in the 
twin paradox example, follow world lines of 


different length through space-time. 


Lorentz transformations in space-time 


We have already noted how the Lorentz transformation leaves 
Equation: 


As* = (Ax)? + (Ay)? + (Az)? — (cAt)? 


unchanged and corresponds to a rotation of axes in the four-dimensional 
space-time. If the S and S/ frames are in relative motion along their shared 
x-direction the space and time axes of S/ are rotated by an angle a as seen 
from S, in the way shown in shown in [link], where: 

Equation: 


Vv 
tana = — = £. 
Cc 


This differs from a rotation in the usual three-dimension sense, insofar as 
the two space-time axes rotate toward each other symmetrically in a 
scissors-like way, as shown. The rotation of the time and space axes are 
both through the same angle. The mesh of dashed lines parallel to the two 
axes show how coordinates of an event would be read along the primed 
axes. This would be done by following a line parallel to the x/ and one 
parallel to the t/-axis, as shown by the dashed lines. The length scale of 
both axes are changed by: 

Equation: 


1 2 1 2 
ctl= ct Cae tl=7 Te : 
1 — ~? 1 — £? 


The line labeled “uv = c” at 45° to the x-axis corresponds to the edge of the 
light cone, and is unaffected by the Lorentz transformation, in accordance 
with the second postulate of relativity. The “v = c” line, and the light cone 
it represents, are the same for both the S and S/ frame of reference. 


The Lorentz transformation results in new space 
and time axes rotated in a scissors-like way 
with respect to the original axes. 


Simultaneity 


Simultaneity of events at separated locations depends on the frame of 
reference used to describe them, as given by the scissors-like “rotation” to 
new time and space coordinates as described. If two events have the same t 
values in the unprimed frame of reference, they need not have the same 
values measured along the ct/-axis, and would then not be simultaneous in 
the primed frame. 


As a specific example, consider the near-light-speed train in which flash 
lamps at the two ends of the car have flashed simultaneously in the frame of 
reference of an observer on the ground. The space-time graph is shown 
[link]. The flashes of the two lamps are represented by the dots labeled 
“Left flash lamp” and “Right flash lamp” that lie on the light cone in the 
past. The world line of both pulses travel along the edge of the light cone to 
arrive at the observer on the ground simultaneously. Their arrival is the 
event at the origin. They therefore had to be emitted simultaneously in the 
unprimed frame, as represented by the point labeled as t(both). But time is 
measured along the ct/-axis in the frame of reference of the observer seated 
in the middle of the train car. So in her frame of reference, the emission 
event of the bulbs labeled as ¢/ (left) and ¢/ (right) were not simultaneous. 


ct 


The train example revisited. The flashes occur at the same time 
t(both) along the time axis of the ground observer, but at different 
times, along the ¢/ time axis of the passenger. 


In terms of the space-time diagram, the two observers are merely using 
different time axes for the same events because they are in different inertial 
frames, and the conclusions of both observers are equally valid. As the 


analysis in terms of the space-time diagrams further suggests, the property 
of how simultaneity of events depends on the frame of reference results 
from the properties of space and time itself, rather than from anything 
specifically about electromagnetism. 


Summary 


e The Galilean transformation equations describe how, in classical 
nonrelativistic mechanics, the position, velocity, and accelerations 
measured in one frame appear in another. Lengths remain unchanged 
and a single universal time scale is assumed to apply to all inertial 
frames. 

e Newton’s laws of mechanics obey the principle of having the same 
form in all inertial frames under a Galilean transformation, given by 
Equation: 


CS, YS ee a ee. 


The concept that times and distances are the same in all inertial frames 
in the Galilean transformation, however, is inconsistent with the 
postulates of special relativity. 

¢ The relativistically correct Lorentz transformation equations are 
Equation: 


Lorentz transformation Inverse Lorentz transformation 


— ttvat/c? is: t—v2x/c? 
Jee Jie 
c= xzi+uvtl r= x—vt 
Je yaaa 
y=yl yi=y 


We can obtain these equations by requiring an expanding spherical 
light signal to have the same shape and speed of growth, c, in both 
reference frames. 


¢ Relativistic phenomena can be explained in terms of the geometrical 
properties of four-dimensional space-time, in which Lorentz 
transformations correspond to rotations of axes. 

e The Lorentz transformation corresponds to a space-time axis rotation, 
similar in some ways to a rotation of space axes, but in which the 
invariant spatial separation is given by As rather than distances Ar, 
and that the Lorentz transformation involving the time axis does not 
preserve perpendicularity of axes or the scales along the axes. 

e The analysis of relativistic phenomena in terms of space-time diagrams 
supports the conclusion that these phenomena result from properties of 
space and time itself, rather than from the laws of electromagnetism. 


Problems 


Exercise: 


Problem: 


Describe the following physical occurrences as events, that is, in the 
form (x, y, Z, t): (a) A postman rings a doorbell of a house precisely at 
noon. (b) At the same time as the doorbell is rung, a slice of bread 
pops out of a toaster that is located 10 m from the door in the east 
direction from the door. (c) Ten seconds later, an airplane arrives at the 
airport, which is 10 km from the door in the east direction and 2 km to 
the south. 


Exercise: 
Problem: 
Describe what happens to the angle a = tan (v/c), and therefore to 


the transformed axes in [link], as the relative velocity v of the S and S/ 
frames of reference approaches c. 


Solution: 


The angle a approaches 45°, and the ¢/- and x/-axes rotate toward the 
edge of the light cone. 


Exercise: 


Problem: 


Describe the shape of the world line on a space-time diagram of (a) an 

object that remains at rest at a specific position along the x-axis; (b) an 
object that moves at constant velocity u in the x-direction; (c) an object 
that begins at rest and accelerates at a constant rate of in the positive x- 
direction. 


Exercise: 
Problem: 
A man standing still at a train station watches two boys throwing a 
baseball in a moving train. Suppose the train is moving east with a 
constant speed of 20 m/s and one of the boys throws the ball with a 
speed of 5 m/s with respect to himself toward the other boy, who is 5 


m west from him. What is the velocity of the ball as observed by the 
man on the station? 


Solution: 


15 m/s east 
Exercise: 
Problem: 
When observed from the sun at a particular instant, Earth and Mars 
appear to move in opposite directions with speeds 108,000 km/h and 


86,871 km/h, respectively. What is the speed of Mars at this instant 
when observed from Earth? 


Exercise: 
Problem: 
A man is running on a straight road perpendicular to a train track and 
away from the track at a speed of 12 m/s. The train is moving with a 


speed of 30 m/s with respect to the track. What is the speed of the man 
with respect to a passenger sitting at rest in the train? 


Solution: 


32 m/s 
Exercise: 


Problem: 


A man is running on a straight road that makes 30° with the train 
track. The man is running in the direction on the road that is away 
from the track at a speed of 12 m/s. The train is moving with a speed 
of 30 m/s with respect to the track. What is the speed of the man with 
respect to a passenger sitting at rest in the train? 


Exercise: 


Problem: 


In a frame at rest with respect to the billiard table, a billiard ball of 
mass m moving with speed v strikes another billiard ball of mass m at 
rest. The first ball comes to rest after the collision while the second 
ball takes off with speed v in the original direction of the motion of the 
first ball. This shows that momentum is conserved in this frame. (a) 
Now, describe the same collision from the perspective of a frame that 
is moving with speed v in the direction of the motion of the first ball. 
(b) Is the momentum conserved in this frame? 


Solution: 


a. The second ball approaches with velocity —v and comes to rest while 
the other ball continues with velocity —v; b. This conserves 
momentum. 


Exercise: 


Problem: 


In a frame at rest with respect to the billiard table, two billiard balls of 
Same mass m are moving toward each other with the same speed v. 
After the collision, the two balls come to rest. (a) Show that 
momentum is conserved in this frame. (b) Now, describe the same 
collision from the perspective of a frame that is moving with speed v in 
the direction of the motion of the first ball. (c) Is the momentum 
conserved in this frame? 


Exercise: 


Problem: 


In a frame S, two events are observed: event 1: a pion is created at rest 
at the origin and event 2: the pion disintegrates after time 7. Another 
observer in a frame S/ is moving in the positive direction along the 
positive x-axis with a constant speed v and observes the same two 
events in his frame. The origins of the two frames coincide at 

t = t/= 0. (a) Find the positions and timings of these two events in the 
frame S/ (a) according to the Galilean transformation, and (b) 
according to the Lorentz transformation. 


Solution: 


t= 0; x1/= 0; ty/= 0; 2 1/= 0; 


d. ) ° is T : —™ —UT 
t= 7; tyt=0' 8! ee 8I= Fe 


Glossary 


event 
occurrence in space and time specified by its position and time 
coordinates (x, y, Z, t) measured relative to a frame of reference 


Galilean transformation 
relation between position and time coordinates of the same events as 
seen in different reference frames, according to classical mechanics 


Lorentz transformation 
relation between position and time coordinates of the same events as 
seen in different reference frames, according to the special theory of 
relativity 


world line 
path through space-time 


Relativistic Velocity Transformation 
By the end of this section, you will be able to: 


¢ Derive the equations consistent with special relativity for transforming 
velocities in one inertial frame of reference into another. 

e Apply the velocity transformation equations to objects moving at 
relativistic speeds. 

e Examine how the combined velocities predicted by the relativistic 
transformation equations compare with those expected classically. 


Remaining in place in a kayak in a fast-moving river takes effort. The river 
current pulls the kayak along. Trying to paddle against the flow can move 
the kayak upstream relative to the water, but that only accounts for part of 
its velocity relative to the shore. The kayak’s motion is an example of how 
velocities in Newtonian mechanics combine by vector addition. The 
kayak’s velocity is the vector sum of its velocity relative to the water and 
the water’s velocity relative to the riverbank. However, the relativistic 
addition of velocities is quite different. 


Velocity Transformations 


Imagine a car traveling at night along a straight road, as in [link]. The driver 
sees the light leaving the headlights at speed c within the car’s frame of 
reference. If the Galilean transformation applied to light, then the light from 
the car’s headlights would approach the pedestrian at a speed u = v + c, 
contrary to Einstein’s postulates. 


According to experimental results and the second postulate of 
relativity, light from the car’s headlights moves away from the car at 
speed c and toward the observer on the sidewalk at speed c. 


Both the distance traveled and the time of travel are different in the two 
frames of reference, and they must differ in a way that makes the speed of 
light the same in all inertial frames. The correct rules for transforming 
velocities from one frame to another can be obtained from the Lorentz 
transformation equations. 


Relativistic Transformation of Velocity 


Suppose an object P is moving at constant velocity u = (w/,, ui, uw) as 


measured in the S/ frame. The S/ frame is moving along its x/-axis at 
velocity v. In an increment of time dt/, the particle is displaced by dx/ 
along the z/-axis. Applying the Lorentz transformation equations gives the 
corresponding increments of time and displacement in the unprimed axes: 
Equation: 


dt = + (dt' + vdx'/c’) 
dx = y(dz' + vdt') 

dy dy’ 

2 de. 


The velocity components of the particle seen in the unprimed coordinate 
system are then 


Equation: 
dz __y(dei+udtt) ty + 
dt — = ¥(dtt+vudzi/c?) 1+4 @ 
ae. ees: 3 
f OS SaneETEY ~ Gg) 
dz dat 
tS SURES ~ By 


We thus obtain the equations for the velocity components of the object as 
seen in frame S: 
Equation: 


(mete), felt 8 (tel 
‘ l+vui,/c2}? % L+vul,/c2? J? * 1+ vul,/c? ) 


Compare this with how the Galilean transformation of classical mechanics 
says the velocities transform, by adding simply as vectors: 
Equation: 


/ / / 
Ug = Uz TU, Uy = Uy, Uz = Uz. 
When the relative velocity of the frames is much smaller than the speed of 


light, that is, when v < c, the special relativity velocity addition law 
reduces to the Galilean velocity law. When the speed v of S/ relative to S is 


comparable to the speed of light, the relativistic velocity addition law 
gives a much smaller result than the classical (Galilean) velocity addition 
does. 


Example: 

Velocity Transformation Equations for Light 

Suppose a spaceship heading directly toward Earth at half the speed of 
light sends a signal to us on a laser-produced beam of light ([link]). Given 
that the light leaves the ship at speed c as observed from the ship, calculate 
the speed at which it approaches Earth. 


laser light : 
yO 
a 
—P 
= 0.500¢ 


How fast does a light signal approach Earth if sent from a spaceship 
traveling at 0.500c? 


Strategy 

Because the light and the spaceship are moving at relativistic speeds, we 
cannot use simple velocity addition. Instead, we determine the speed at 
which the light approaches Earth using relativistic velocity addition. 
Solution 


a. Identify the knowns: v = 0.500c; u/= c. 
b. Identify the unknown: u. 


c. Express the answer as an equation: u = “4 


1 


d. Do the calculation: 
Equation: 


— PRE 

ees 

= 0.500c+c 
1: oe 

= (0.500+1)c 
( tg ) 


= C. 


Significance 

Relativistic velocity addition gives the correct result. Light leaves the ship 
at speed c and approaches Earth at speed c. The speed of light is 
independent of the relative motion of source and observer, whether the 
observer is on the ship or earthbound. 


Velocities cannot add to greater than the speed of light, provided that v is 
less than c and w/ does not exceed c. The following example illustrates that 
relativistic velocity addition is not as symmetric as classical velocity 
addition. 


Example: 

Relativistic Package Delivery 

Suppose the spaceship in the previous example approaches Earth at half 
the speed of light and shoots a canister at a speed of 0.750c ([link]). (a) At 
what velocity does an earthbound observer see the canister if it is shot 
directly toward Earth? (b) If it is shot directly away from Earth? 


u’ = 0.750c u’ = —0.750c 
ae ® > Yo > 
y“ ~~ —- p = 0.500e > P : v = 0.500c * 


Canister toward Earth Canister away from Earth 


A canister is fired at 0.7500c toward Earth or away from Earth. 


Strategy 

Because the canister and the spaceship are moving at relativistic speeds, 
we must determine the speed of the canister by an earthbound observer 
using relativistic velocity addition instead of simple velocity addition. 
Solution for (a) 


a. Identify the knowns: v = 0.500c; u/= 0.750c. 
b. Identify the unknown: u. 


é / 
c. Express the answer as an equation: uw = 


1+4° 


& 


d. Do the calculation: 
Equation: 
vtul 
oars 
0.500c+0.750c 
ie ee ee 


ic 


he = 


— J oUoe 


Solution for (b) 


a. Identify the knowns: v = 0.500c; u/= —0.750c. 
b. Identify the unknown: u. 


9 (} 
c. Express the answer as an equation: wu = 


144° 


G 


d. Do the calculation: 
Equation: 


= vtul 

Mees 

0.500c+(—0.750c) 
0.500c)(—0.750c 

1 4 ©:5006)(=0.750e) 


C 


= —0.400c. 


Significance 

The minus sign indicates a velocity away from Earth (in the opposite 
direction from v), which means the canister is heading toward Earth in part 
(a) and away in part (b), as expected. But relativistic velocities do not add 
as simply as they do classically. In part (a), the canister does approach 
Earth faster, but at less than the vector sum of the velocities, which would 
give 1.250c. In part (b), the canister moves away from Earth at a velocity 
of —0.400c, which is faster than the —0.250c expected classically. The 
differences in velocities are not even symmetric: In part (a), an observer on 
Earth sees the canister and the ship moving apart at a speed of 0.409c, and 
at a speed of 0.900c in part (b). 


Note: 
Exercise: 


Problem: 


Check Your Understanding Distances along a direction 
perpendicular to the relative motion of the two frames are the same in 
both frames. Why then are velocities perpendicular to the x-direction 
different in the two frames? 


Solution: 


Although displacements perpendicular to the relative motion are the 
same in both frames of reference, the time interval between events 
differ, and differences in dt and dt/ lead to different velocities seen 
from the two frames. 


Summary 


e With classical velocity addition, velocities add like regular numbers in 
one-dimensional motion: wu = v + w/, where v is the velocity between 


two observers, u is the velocity of an object relative to one observer, 
and wu/ is the velocity relative to the other observer. 

e Velocities cannot add to be greater than the speed of light. 

e Relativistic velocity addition describes the velocities of an object 
moving at a relativistic velocity. 


Problems 


Exercise: 
Problem: 
If two spaceships are heading directly toward each other at 0.800c, at 


what speed must a canister be shot from the first ship to approach the 
other at 0.999c as seen by the second ship? 


Exercise: 
Problem: 
Two planets are on a collision course, heading directly toward each 
other at 0.250c. A spaceship sent from one planet approaches the 


second at 0.750c as seen by the second planet. What is the velocity of 
the ship relative to the first planet? 


Solution: 


0.615c 
Exercise: 
Problem: 
When a missile is shot from one spaceship toward another, it leaves the 


first at 0.950c and approaches the other at 0.750c. What is the relative 
velocity of the two ships? 


Exercise: 


Problem: 


What is the relative velocity of two spaceships if one fires a missile at 
the other at 0.750c and the other observes it to approach at 0.950c? 


Solution: 


0.696c 
Exercise: 
Problem: 
Prove that for any relative velocity v between two observers, a beam of 


light sent from one to the other will approach at speed c (provided that 
v is less than c, of course). 


Exercise: 
Problem: 
Show that for any relative velocity v between two observers, a beam of 


light projected by one directly away from the other will move away at 
the speed of light (provided that v is less than c, of course). 


Solution: 


(Proof) 


Glossary 


classical (Galilean) velocity addition 
method of adding velocities when v< <c; velocities add like regular 
numbers in one-dimensional motion: u = v + wu/, where v is the 
velocity between two observers, u is the velocity of an object relative 
to one observer, and wu/ is the velocity relative to the other observer 


relativistic velocity addition 
method of adding velocities of an object moving at a relativistic speeds 


Doppler Effect for Light 
By the end of this section, you will be able to: 


e Explain the origin of the shift in frequency and wavelength of the observed 
wavelength when observer and source moved toward or away from each other 

e Derive an expression for the relativistic Doppler shift 

e Apply the Doppler shift equations to real-world examples 


As discussed in the chapter on sound, if a source of sound and a listener are moving farther 
apart, the listener encounters fewer cycles of a wave in each second, and therefore lower 
frequency, than if their separation remains constant. For the same reason, the listener 
detects a higher frequency if the source and listener are getting closer. The resulting 
Doppler shift in detected frequency occurs for any form of wave. For sound waves, 
however, the equations for the Doppler shift differ markedly depending on whether it is the 
source, the observer, or the air, which is moving. Light requires no medium, and the 
Doppler shift for light traveling in vacuum depends only on the relative speed of the 
observer and source. 


The Relativistic Doppler Effect 


Suppose an observer in S sees light from a source in S/ moving away at velocity v ([link]). 
The wavelength of the light could be measured within S/—for example, by using a mirror 
to set up standing waves and measuring the distance between nodes. These distances are 
proper lengths with S’ as their rest frame, and change by a factor ,/1 — v?/c? when 
measured in the observer’s frame S, where the ruler measuring the wavelength in S/ is seen 
as moving. 


observer source 


(a) 


c At v At 


(b) 


(a) When a light wave is emitted by a source fixed in the 
moving inertial frame S/, the observer in S sees the 
wavelength measured in S/. to be shorter by a factor 

,/1 — v?/c2. (b) Because the observer sees the source 
moving away within S, the wave pattern reaching the 
observer in S is also stretched by the factor 
(cAt + vAt)/ (cAt) = 1+ v/e. 


If the source were stationary in S, the observer would see a length cAt of the wave pattern 
in time At. But because of the motion of S/ relative to S, considered solely within S, the 
observer sees the wave pattern, and therefore the wavelength, stretched out by a factor of 


Equation: 


CA tperiod + vAtperiod = v 
CAtperiod c 


as illustrated in (b) of [link]. The overall increase from both effects gives 
Equation: 


dae Ae (14S) = de (t+ 2) ay 


where Agr- is the wavelength of the light seen by the source in S/ and Aops is the 
wavelength that the observer detects within S. 


Red Shifts and Blue Shifts 


The observed wavelength A,ps of electromagnetic radiation is longer (called a “red shift”) 
than that emitted by the source when the source moves away from the observer. Similarly, 
the wavelength is shorter (called a “blue shift”) when the source moves toward the 
observer. The amount of change is determined by 


Equation: 
1+2 
Aobs = Xs = a 


where A, is the wavelength in the frame of reference of the source, and v is the relative 

velocity of the two frames S and S/. The velocity v is positive for motion away from an 
observer and negative for motion toward an observer. In terms of source frequency and 
observed frequency, this equation can be written as 


Equation: 
1— 
fobs = fs ia 


Notice that the signs are different from those of the wavelength equation. 


a |e 


ale 


Example: 

Calculating a Doppler Shift 

Suppose a galaxy is moving away from Earth at a speed 0.825c. It emits radio waves with 
a wavelength of 

0.525 m. What wavelength would we detect on Earth? 

Strategy 


Because the galaxy is moving at a relativistic speed, we must determine the Doppler shift 
of the radio waves using the relativistic Doppler shift instead of the classical Doppler shift. 


Solution 


a. Identify the knowns: u = 0.825c; A, = 0.525 m. 
b. Identify the unknown: Apps.- 
c. Express the answer as an equation: 

Equation: 


d. Do the calculation: 


Equation: 
142 
oie = raf — 
ee 0.825c 
= 1.70mi 
Significance 


Because the galaxy is moving away from Earth, we expect the wavelengths of radiation it 
emits to be redshifted. The wavelength we calculated is 1.70 m, which is redshifted from 
the original wavelength of 0.525 m. You will see in Particle Physics and Cosmology that 
detecting redshifted radiation led to present-day understanding of the origin and evolution 


of the universe. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Suppose a space probe moves away from Earth at a 
speed 0.350c. It sends a radio-wave message back to Earth at a frequency of 1.50 


GHz. At what frequency is the message received on Earth? 


Solution: 


We can substitute the data directly into the equation for relativistic Doppler 
frequency: 


1-2 _ 0.3500 
fobs = ful Ife = (1.50 GHz) ae = 1.04 GHz. 


The relativistic Doppler effect has applications ranging from Doppler radar storm 
monitoring to providing information on the motion and distance of stars. We describe some 
of these applications in the exercises. 


Summary 


e An observer of electromagnetic radiation sees relativistic Doppler effects if the source 
of the radiation is moving relative to the observer. The wavelength of the radiation is 
longer (called a red shift) than that emitted by the source when the source moves away 
from the observer and shorter (called a blue shift) when the source moves toward the 
observer. The shifted wavelength is described by the equation: 

Equation: 


Aobs — As 


where Aops is the observed wavelength, A, is the source wavelength, and v is the 
relative velocity of the source to the observer. 


Conceptual Questions 


Exercise: 
Problem: 
Explain the meaning of the terms “red shift” and “blue shift” as they relate to the 
relativistic Doppler effect. 
Exercise: 
Problem: 


What happens to the relativistic Doppler effect when relative velocity is zero? Is this 
the expected result? 


Solution: 


There is no measured change in wavelength or frequency in this case. The relativistic 
Doppler effect depends only on the relative velocity of the source and the observer, 
not any speed relative to a medium for the light waves. 


Exercise: 
Problem: 
Is the relativistic Doppler effect consistent with the classical Doppler effect in the 
respect that Apps is larger for motion away? 

Exercise: 
Problem: 
All galaxies farther away than about 50 x 10° ly exhibit a red shift in their emitted 
light that is proportional to distance, with those farther and farther away having 


progressively greater red shifts. What does this imply, assuming that the only source 
of red shift is relative motion? 


Solution: 


It shows that the stars are getting more distant from Earth, that the universe is 
expanding, and doing so at an accelerating rate, with greater velocity for more distant 
stars. ] 


Problems 


Exercise: 


Problem: 


A highway patrol officer uses a device that measures the speed of vehicles by 
bouncing radar off them and measuring the Doppler shift. The outgoing radar has a 
frequency of 100 GHz and the returning echo has a frequency 15.0 kHz higher. What 
is the velocity of the vehicle? Note that there are two Doppler shifts in echoes. Be 
certain not to round off until the end of the problem, because the effect is small. 


Relativistic Momentum 
By the end of this section, you will be able to: 


¢ Define relativistic momentum in terms of mass and velocity 

¢ Show how relativistic momentum relates to classical momentum 

¢ Show how conservation of relativistic momentum limits objects with mass to 
speeds less than c 


Momentum is a central concept in physics. The broadest form of Newton’s second law 
is stated in terms of momentum. Momentum is conserved whenever the net external 
force on a system is zero. This makes momentum conservation a fundamental tool for 
analyzing collisions ([link]). Much of what we know about subatomic structure comes 
from the analysis of collisions of accelerator-produced relativistic particles, and 
momentum conservation plays a crucial role in this analysis. 


Momentum is an important concept for 
these football players from the University of 
California at Berkeley and the University of 
California at Davis. A player with the same 

velocity but greater mass collides with 

greater impact because his momentum is 


greater. For objects moving at relativistic 
speeds, the effect is even greater. 


The first postulate of relativity states that the laws of physics are the same in all inertial 
frames. Does the law of conservation of momentum survive this requirement at high 
velocities? It can be shown that the momentum calculated as merely p = m&, even 
if it is conserved in one frame of reference, may not be conserved in another after 
applying the Lorentz transformation to the velocities. The correct equation for 
momentum can be shown, instead, to be the classical expression in terms of the 
increment dr of proper time of the particle, observed in the particle’s rest frame: 


Equation: 


= J = ym. 


Note: 

Relativistic Momentum 

Relativistic momentum p is classical momentum multiplied by the relativistic factor 
y: 

Equation: 


p = ymu 


where m is the rest mass of the object, U is its velocity relative to an observer, and y 
is the relativistic factor: 
Equation: 


Note that we use u for velocity here to distinguish it from relative velocity v between 
observers. The factor y that occurs here has the same form as the previous relativistic 
factor y except that it is now in terms of the velocity of the particle u instead of the 
relative velocity v of two frames of reference. 


With p expressed in this way, total momentum pio is conserved whenever the net 
external force is zero, just as in classical physics. Again we see that the relativistic 
quantity becomes virtually the same as the classical quantity at low velocities, where 
u/c is small and ¥ is very nearly equal to 1. Relativistic momentum has the same 
intuitive role as classical momentum. It is greatest for large masses moving at high 
velocities, but because of the factor y, relativistic momentum approaches infinity as u 
approaches c ((link]). This is another indication that an object with mass cannot reach 
the speed of light. If it did, its momentum would become infinite—an unreasonable 
value. 
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Relativistic momentum approaches infinity as 
the velocity of an object approaches the speed 
of light. 


The relativistically correct definition of momentum as p = ymu is sometimes taken to 
imply that mass varies with velocity: Mya, = ym, particularly in older textbooks. 
However, note that m is the mass of the object as measured by a person at rest relative 
to the object. Thus, m is defined to be the rest mass, which could be measured at rest, 
perhaps using gravity. When a mass is moving relative to an observer, the only way 
that its mass can be determined is through collisions or other means involving 
momentum. Because the mass of a moving object cannot be determined independently 


of momentum, the only meaningful mass is rest mass. Therefore, when we use the 
term “mass,” assume it to be identical to “rest mass.” 


Relativistic momentum is defined in such a way that conservation of momentum holds 
in all inertial frames. Whenever the net external force on a system is zero, relativistic 
momentum is conserved, just as is the case for classical momentum. This has been 
verified in numerous experiments. 


Note: 
Exercise: 


Problem: 


Check Your Understanding What is the momentum of an electron traveling at a 
speed 0.985c? The rest mass of the electron is 9.11 x 10~*+kg. 


Solution: 


Substitute the data into the given equation: 


p= ymu = 4 = (9-11 x 10 ** kg)(0.985)(3.00 x 10° m/s) _ 4 re Y 49-21 kg-m/s. 


2 
ju 0.985¢)? 
Vv 5) j1-e 


Summary 


e The law of conservation of momentum is valid for relativistic momentum 
whenever the net external force is zero. The relativistic momentum is p = ymu, 
where m is the rest mass of the object, u is its velocity relative to an observer, and 


the relativistic factor is y = . 


e At low velocities, relativistic momentum is equivalent to classical momentum. 
¢ Relativistic momentum approaches infinity as u approaches c. This implies that 
an object with mass cannot reach the speed of light. 


Conceptual Questions 


Exercise: 


Problem: 


How does modern relativity modify the law of conservation of momentum? 
Exercise: 


Problem: 


Is it possible for an external force to be acting on a system and relativistic 
momentum to be conserved? Explain. 


Solution: 
Yes. This can happen if the external force is balanced by other externally applied 
forces, so that the net external force is zero. 

Problems 


Exercise: 


Problem: 


Find the momentum of a helium nucleus having a mass of 6.68 x 10?’ kg that 
is moving at 0.200c. 


Solution: 


4.09 x 10°’ kg- m/s 


Exercise: 


Problem: What is the momentum of an electron traveling at 0.980c? 
Exercise: 


Problem: 


(a) Find the momentum of a1.00 x 10°-kg asteroid heading towards Earth at 
30.0 km/s. (b) Find the ratio of this momentum to the classical momentum. (Hint: 
Use the approximation that ~ = 1 + (1/2)v?/c? at low velocities.) 


Solution: 


a. 3.000000015 x 10/% kg - m/s; b. 1.000000005 


Exercise: 
Problem: 
(a) What is the momentum of a 2000-kg satellite orbiting at 4.00 km/s? (b) Find 


the ratio of this momentum to the classical momentum. (Hint: Use the 
approximation that y = 1 + (1/2)v?/c? at low velocities.) 


Exercise: 


Problem: 


What is the velocity of an electron that has a momentum of 
3.04 x 10°?! kg - m/s ? Note that you must calculate the velocity to at least 
four digits to see the difference from c. 


Solution: 


2.988 x 10° m/s 
Exercise: 


Problem: 


Find the velocity of a proton that has a momentum of 4.48 x io kg - m/s. 


Glossary 


relativistic momentum 
p, the momentum of an object moving at relativistic velocity; p = ymu 


rest mass 
mass of an object as measured by an observer at rest relative to the object 


Relativistic Energy 
By the end of this section, you will be able to: 


e Explain how the work-energy theorem leads to an expression for the relativistic kinetic 
energy of an object 

e Show how the relativistic energy relates to the classical kinetic energy, and sets a limit 
on the speed of any object with mass 

e Describe how the total energy of a particle is related to its mass and velocity 

e Explain how relativity relates to energy-mass equivalence, and some of the practical 
implications of energy-mass equivalence 


The tokamak in [link] is a form of experimental fusion reactor, which can change mass to 
energy. Nuclear reactors are proof of the relationship between energy and matter. 


Conservation of energy is one of the most important laws in physics. Not only does energy 
have many important forms, but each form can be converted to any other. We know that 
classically, the total amount of energy in a system remains constant. Relativistically, energy 
is still conserved, but energy-mass equivalence must now be taken into account, for 
example, in the reactions that occur within a nuclear reactor. Relativistic energy is 
intentionally defined so that it is conserved in all inertial frames, just as is the case for 
relativistic momentum. As a consequence, several fundamental quantities are related in ways 
not known in classical physics. All of these relationships have been verified by experimental 
results and have fundamental consequences. The altered definition of energy contains some 
of the most fundamental and spectacular new insights into nature in recent history. 


The National Spherical Torus Experiment (NSTX) is a 
fusion reactor in which hydrogen isotopes undergo 
fusion to produce helium. In this process, a relatively 
small mass of fuel is converted into a large amount of 


energy. (credit: Princeton Plasma Physics Laboratory) 


Kinetic Energy and the Ultimate Speed Limit 


The first postulate of relativity states that the laws of physics are the same in all inertial 
frames. Einstein showed that the law of conservation of energy of a particle is valid 
relativistically, but for energy expressed in terms of velocity and mass in a way consistent 
with relativity. 


Consider first the relativistic expression for the kinetic energy. We again use u for velocity to 
distinguish it from relative velocity v between observers. Classically, kinetic energy is 
related to mass and speed by the familiar expression K = ~ mu’. The corresponding 
relativistic expression for kinetic energy can be obtained from the work-energy theorem. 
This theorem states that the net work on a system goes into kinetic energy. Specifically, if a 


= dp d(yu : : : * 
force, expressed as F = a =m uo , accelerates a particle from rest to its final velocity, 


the work done on the particle should be equal to its final kinetic energy. In mathematical 
form, for one-dimensional motion: 
Equation: 


d 
K = [ Fax = [ms (yu)dx 


d(yu) dzx a d u 
= m | —dt=m]lu dt. 
dt at dt f cots Je)? 


Integrate this by parts to obtain 
Equation: 


mu2 

= ap -m | —*— te dt 
1 —(u/e 

mu2 U 
wer m | Tae 
“thm 
=: mu? 

V1-(u/e)? 1—(u/c)? 

Be eves ae 


4/1—(u/c)? 


2 
K = Mo — me’. 


V1-(u/e)? 


Note: 

Relativistic Kinetic Energy 

Relativistic kinetic energy of any particle of mass m is 
Equation: 


Kye = (¥ - 1)mce’. 


When an object is motionless, its speed is u = 0 and 
Equation: 


so that Kye, = 0 at rest, as expected. But the expression for relativistic kinetic energy (such 
as total energy and rest energy) does not look much like the classical — mu”. To show that 
the expression for K,,, reduces to the classical expression for kinetic energy at low speeds, 
we use the binomial expansion to obtain an approximation for (1 + €)” valid for small e: 
Equation: 


—1 —1)(n-2 
(1+ .«)” =1tne+ MU + a aa ~ 14+ née 
by neglecting the very small terms in €? and higher powers of e. Choosing € = —u?/c? and 
= —+ leads to the conclusion that y at nonrelativistic speeds, where ¢ = w/c is small, 
satisfies 
Equation: 


= 1 fu? 
y= (1-wu?/c’) Pg i ae +(5). 
c 
A binomial expansion is a way of expressing an algebraic quantity as a sum of an infinite 


series of terms. In some cases, as in the limit of small speed here, most terms are very small. 
Thus, the expression derived here for + is not exact, but it is a very accurate approximation. 


Therefore, at low speed: 
1 (wv 
—1=—| — }. 
r1=3(3) 


Equation: 
Entering this into the expression for relativistic kinetic energy gives 


Equation: 
1 (wu? 1 
Kye = | i mc aT mu. = AK nee 
C2 2 


That is, relativistic kinetic energy becomes the same as classical kinetic energy when u<<c. 


It is even more interesting to investigate what happens to kinetic energy when the speed of 
an object approaches the speed of light. We know that yy becomes infinite as u approaches c, 
so that Kye, also becomes infinite as the velocity approaches the speed of light ((link]). The 
increase in K,,) is far larger than in K gjasg aS V approaches c. An infinite amount of work 
(and, hence, an infinite amount of energy input) is required to accelerate a mass to the speed 
of light. 


Note: 
The Speed of Light 
No object with mass can attain the speed of light. 


The speed of light is the ultimate speed limit for any particle having mass. All of this is 
consistent with the fact that velocities less than c always add to less than c. Both the 
relativistic form for kinetic energy and the ultimate speed limit being c have been confirmed 
in detail in numerous experiments. No matter how much energy is put into accelerating a 
mass, its velocity can only approach—not reach—the speed of light. 


Kinetic Energy, K (J) 
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This graph of K,. versus velocity shows how 
kinetic energy increases without bound as 
velocity approaches the speed of light. Also 
shown is Klass, the classical kinetic energy. 


Example: 

Comparing Kinetic Energy 

An electron has a velocity v = 0.990c. (a) Calculate the kinetic energy in MeV of the 
electron. (b) Compare this with the classical value for kinetic energy at this velocity. (The 
mass of an electron is 9.11 x 10-*kg.) 

Strategy 

The expression for relativistic kinetic energy is always correct, but for (a), it must be used 
because the velocity is highly relativistic (close to c). First, we calculate the relativistic 
factor y, and then use it to determine the relativistic kinetic energy. For (b), we calculate the 
classical kinetic energy (which would be close to the relativistic value if v were less than a 
few percent of c) and see that it is not the same. 

Solution for (a) 

For part (a): 


a. Identify the knowns: v = 0.990c;m = 9.11 x 10~*!kg. 
b. Identify the unknown: Kye). 


c. Express the answer as an equation: Ky.) = (y — 1)mc? with y = l 


/1—u2/c2 * 
d. Do the calculation. First calculate ~. Keep extra digits because this is an intermediate 
calculation: 
Equation: 
1 
BY, — 
i 
= 1 
c)2 
/ ee a ) 
= (20808. 


Now use this value to calculate the kinetic energy: 
Equation: 
Kya = (¥ —1)me? 
= (7.0888 — 1)(9.11 x 10~*! kg)(3.00 x 108 m/s’) 
= AOO2 2 Ne), 


e. Convert units: 
Equation: 


= —13 1 MeV 
Kye = (4.9922 x 10-13 J) (alts) 
— 3.12 MeV. 


Solution for (b) 
For part (b): 


a. List the knowns: v = 0.990c; m = 9.11 x 107*kg. 
b. List the unknown: Kye. 
c. Express the answer as an equation: K glass = 
d. Do the calculation: 

Equation: 


1 2 
37 mu. 


PS einae = = mu 
aay al —31 2 8 2 
= 1(9.11 x 10-3!kg)(0.990)?(3.00 x 10% m/s) 
= 4.0179 x 10°" J. 


e. Convert units: 


Equation: 


== —14 1 MeV 
TK ages = 4.0179 x 10 4) (=_itts;) 
= 0.251 Mev. 


Significance 

As might be expected, because the velocity is 99.0% of the speed of light, the classical 
kinetic energy differs significantly from the correct relativistic value. Note also that the 
classical value is much smaller than the relativistic value. In fact, Kye1/K class = 12.4 in 
this case. This illustrates how difficult it is to get a mass moving close to the speed of light. 
Much more energy is needed than predicted classically. Ever-increasing amounts of energy 
are needed to get the velocity of a mass a little closer to that of light. An energy of 3 MeV is 
a very small amount for an electron, and it can be achieved with present-day particle 
accelerators. SLAC, for example, can accelerate electrons to over 

50 x 10°eV = 50,000 MeV. 


Is there any point in getting v a little closer to c than 99.0% or 99.9%? The answer is yes. 
We learn a great deal by doing this. The energy that goes into a high-velocity mass can be 
converted into any other form, including into entirely new particles. In the Large Hadron 
Collider in [link], charged particles are accelerated before entering the ring-like structure. 
There, two beams of particles are accelerated to their final speed of about 99.7% the speed 
of light in opposite directions, and made to collide, producing totally new species of 
particles. Most of what we know about the substructure of matter and the collection of 
exotic short-lived particles in nature has been learned this way. Patterns in the characteristics 
of these previously unknown particles hint at a basic substructure for all matter. These 
particles and some of their characteristics will be discussed in a later chapter on particle 
physics. 


The European Organization for Nuclear 
Research (called CERN after its French 
name) operates the largest particle 
accelerator in the world, straddling the 
border between France and Switzerland. 
(credit: modification of work by NASA) 


Total Relativistic Energy 


The expression for kinetic energy can be rearranged to: 
Equation: 
mu? 
B= — =- K+ me’. 


/1—u?/c? 


Einstein argued in a separate article, also later published in 1905, that if the energy of a 
particle changes by AE, its mass changes by Am = AE/c?. Abundant experimental 
evidence since then confirms that mc? corresponds to the energy that the particle of mass m 
has when at rest. For example, when a neutral pion of mass m at rest decays into two 
photons, the photons have zero mass but are observed to have total energy corresponding to 


mc? for the pion. Similarly, when a particle of mass m decays into two or more particles 
with smaller total mass, the observed kinetic energy imparted to the products of the decay 
corresponds to the decrease in mass. Thus, E is the total relativistic energy of the particle, 
and mc? is its rest energy. 


Note: 

Total Energy 

Total energy E of a particle is 
Equation: 


E = ym 


where m is mass, c is the speed of light, y = 


ers and u is the velocity of the mass 
he UW 


c2 


relative to an observer. 


Note: 

Rest Energy 

Rest energy of an object is 
Equation: 


Ey = mc’. 


This is the correct form of Einstein’s most famous equation, which for the first time showed 
that energy is related to the mass of an object at rest. For example, if energy is stored in the 
object, its rest mass increases. This also implies that mass can be destroyed to release 
energy. The implications of these first two equations regarding relativistic energy are so 
broad that they were not completely recognized for some years after Einstein published them 
in 1905, nor was the experimental proof that they are correct widely recognized at first. 
Einstein, it should be noted, did understand and describe the meanings and implications of 
his theory. 


Example: 

Calculating Rest Energy 

Calculate the rest energy of a 1.00-g mass. 

Strategy 

One gram is a small mass—less than one-half the mass of a penny. We can multiply this 
mass, in SI units, by the speed of light squared to find the equivalent rest energy. 


Solution 


a. Identify the knowns: m = 1.00 x 10-*kg; c = 3.00 x 10°m/s. 
b. Identify the unknown: Eo. 
c. Express the answer as an equation: Ey = mc?. 
d. Do the calculation: 

Equation: 


Ey) = me? = (1.00 x 1073 kg) (3.00 x 108 m/s)” 
= F000 < 10.2 ke m/s) 


e. Convert units. Noting that 1 kg - m?/ sy =1J , we see the rest energy is: 
Equation: 


Ey = 9.00 x 1013 J. 


Significance 

This is an enormous amount of energy for a 1.00-g mass. Rest energy is large because the 
speed of light c is a large number and c? is a very large number, so that mc? is huge for any 
macroscopic mass. The 9.00 x 10’°J rest mass energy for 1.00 g is about twice the energy 
released by the Hiroshima atomic bomb and about 10,000 times the kinetic energy of a 
large aircraft carrier. 


Today, the practical applications of the conversion of mass into another form of energy, such 
as in nuclear weapons and nuclear power plants, are well known. But examples also existed 
when Einstein first proposed the correct form of relativistic energy, and he did describe 

some of them. Nuclear radiation had been discovered in the previous decade, and it had been 
a mystery as to where its energy originated. The explanation was that, in some nuclear 
processes, a small amount of mass is destroyed and energy is released and carried by nuclear 
radiation. But the amount of mass destroyed is so small that it is difficult to detect that any is 
missing. Although Einstein proposed this as the source of energy in the radioactive salts then 
being studied, it was many years before there was broad recognition that mass could be and, 
in fact, commonly is, converted to energy ({link]). 


(a) 


(a) The sun and (b) the Susquehanna Steam Electric Station both convert mass into 
energy—the sun via nuclear fusion, and the electric station via nuclear fission. (credit 
a: modification of work by NASA/SDO (AIA) ) 


Because of the relationship of rest energy to mass, we now consider mass to be a form of 
energy rather than something separate. There had not been even a hint of this prior to 
Einstein’s work. Energy-mass equivalence is now known to be the source of the sun’s 
energy, the energy of nuclear decay, and even one of the sources of energy keeping Earth’s 
interior hot. 


Stored Energy and Potential Energy 


What happens to energy stored in an object at rest, such as the energy put into a battery by 
charging it, or the energy stored in a toy gun’s compressed spring? The energy input 
becomes part of the total energy of the object and thus increases its rest mass. All stored and 
potential energy becomes mass in a system. In seeming contradiction, the principle of 
conservation of mass (meaning total mass is constant) was one of the great laws verified by 
nineteenth-century science. Why was it not noticed to be incorrect? The following example 
helps answer this question. 


Example: 

Calculating Rest Mass 

A car battery is rated to be able to move 600 ampere-hours (A - h) of charge at 12.0 V. (a) 
Calculate the increase in rest mass of such a battery when it is taken from being fully 


depleted to being fully charged, assuming none of the chemical reactants enter or leave the 
battery. (b) What percent increase is this, given that the battery’s mass is 20.0 kg? 

Strategy 

In part (a), we first must find the energy stored as chemical energy att in the battery, 
which equals the electrical energy the battery can provide. Because Eyatt = gV, we have to 
calculate the charge q in 600 A - h, which is the product of the current J and the time t. We 
then multiply the result by 12.0 V. We can then calculate the battery’s increase in mass 
using Eat; = (Am)c?. Part (b) is a simple ratio converted into a percentage. 

Solution for (a) 


a. Identify the knowns: I -¢ = 600 A- h; V = 12.0 V;c = 3.00 x 108 m/s. 
b. Identify the unknown: Am. 
c. Express the answer as an equation: 

Equation: 


Epatt — (Am) C7 


A m= E batt 


d. Do the calculation: 
Equation: 


(600 A - h)(12.0 V) 


i 5 
(3.00 x 108) 


Write amperes A as coulombs per second (C/s), and convert hours into seconds: 
Equation: 
(600 C/s -h) (28%) (12.0 J/C) 

(3.00 x 108 m/s)” 


= 2.88 x 107" kg. 


An = 


where we have used the conversion 1 kg - m?/ s?=1J. 


Solution for (b) 
For part (b): 


a. Identify the knowns: Am = 2.88 x 107'°kg; m = 20.0 kg. 
b. Identify the unknown: % change. 
c. Express the answer as an equation: % increase = Am. x 100%. 


d. Do the calculation: 


Equation: 
% increase = Am x 100% 
2.88 x 10-9 kg 
= nore 100% 
=1.44 x 10°°%. 
Significance 


Both the actual increase in mass and the percent increase are very small, because energy is 
divided by c?, a very large number. We would have to be able to measure the mass of the 
battery to a precision of a billionth of a percent, or 1 part in 101, to notice this increase. It 
is no wonder that the mass variation is not readily observed. In fact, this change in mass is 
so small that we may question how anyone could verify that it is real. The answer is found 
in nuclear processes in which the percentage of mass destroyed is large enough to be 
measured accurately. The mass of the fuel of a nuclear reactor, for example, is measurably 
smaller when its energy has been used. In that case, stored energy has been released 
(converted mostly into thermal energy to power electric generators) and the rest mass has 
decreased. A decrease in mass also occurs from using the energy stored in a battery, except 
that the stored energy is much greater in nuclear processes, making the change in mass 
measurable in practice as well as in theory. 


Relativistic Energy and Momentum 


We know classically that kinetic energy and momentum are related to each other, because: 
Equation: 


AK ise = Aue 
mm 


Relativistically, we can obtain a relationship between energy and momentum by 
algebraically manipulating their defining equations. This yields: 
Equation: 


E? = (pe)? + (me*)”, 


where E is the relativistic total energy, FE = mc?/ / 1 — u?/c?, and p is the relativistic 
momentum. This relationship between relativistic energy and relativistic momentum is more 
complicated than the classical version, but we can gain some interesting new insights by 
examining it. First, total energy is related to momentum and rest mass. At rest, momentum is 


zero, and the equation gives the total energy to be the rest energy mc? (so this equation is 
consistent with the discussion of rest energy above). However, as the mass is accelerated, its 
momentum p increases, thus increasing the total energy. At sufficiently high velocities, the 
rest energy term (me?)’ becomes negligible compared with the momentum term (pc)?; 
thus, & = pc at extremely relativistic velocities. 


If we consider momentum p to be distinct from mass, we can determine the implications of 
the equation E? = (pc)? + (me?)’, for a particle that has no mass. If we take m to be zero 
in this equation, then EF = pc, orp = E’/c. Massless particles have this momentum. There 
are several massless particles found in nature, including photons (which are packets of 
electromagnetic radiation). Another implication is that a massless particle must travel at 
speed c and only at speed c. It is beyond the scope of this text to examine the relationship in 
the equation E? = (pc)? + (me2)” in detail, but you can see that the relationship has 
important implications in special relativity. 


Note: 
Exercise: 


Problem: 


Check Your Understanding What is the kinetic energy of an electron if its speed is 
0.992c? 


Solution: 


SS Oley 10a kay (oid ms) Shien 108 ay 


0.992c)2 
ye ( s ) 


Summary 


The relativistic work-energy theorem is 

Wnet = E — Ey = ymc? — me? = (y — 1)mce?. 

¢ Relativistically, Whee = Kye, where KK, is the relativistic kinetic energy. 

e An object of mass m at velocity u has kinetic energy Kye) = (y — 1)mc?, where 


1 
>a 
ear 
¢ At low velocities, relativistic kinetic energy reduces to classical kinetic energy. 


¢ No object with mass can attain the speed of light, because an infinite amount of work 
and an infinite amount of energy input is required to accelerate a mass to the speed of 
light. 

e Relativistic energy is conserved as long as we define it to include the possibility of 
mass changing to energy. 

e The total energy of a particle with mass m traveling at speed u is defined as 
E = ymc?, where y = = and u denotes the velocity of the particle. 

2 


e The rest energy of an object of mass mis Ky = mc“, meaning that mass is a form of 
energy. If energy is stored in an object, its mass increases. Mass can be destroyed to 
release energy. 

e We do not ordinarily notice the increase or decrease in mass of an object because the 
change in mass is so small for a large increase in energy. The equation 


2 


E? = (pe)? + (me?)* relates the relativistic total energy E and the relativistic 
momentum p. At extremely high velocities, the rest energy mc? becomes negligible, 
and & = pce. 


Key Equations 


Time dilation 


of = 
Lorentz factor ie 2 
Length contraction | bean bre eee a 

Cc 
Galilean 
: 2 = cut =. 2= 2. t= 
transformation Eee as } 
Lorentz ee tual /c? 
transformation VJ1-v?/e? 
— al+vtl 
eee er 7 
y=yl 


Inverse Lorentz te tates 


transformation J 1-v?/c? 
r= x—vt 
J1—v?/e2 
yl=y 
Zl= Zz 


Space-time invariants (As)? = (Ax)* + (Ay)? + (Az)? — c? (At)? 


(Az)?+(Ay)?+(Az)? 
(Ar)* = —(As)"/e? = (At)’ | a | 
Relativistic velocity _ (ute = uy,/ 7 ul, | 
addition tae = \ Trouk fem J? Ue = \ Trowel J? 2 = \ Tree [2 
Relativistic Doppler Fear ct 1 
effect for wavelength obs ™ “As4/ qo2 
Relativistic Doppler _ ee 
effect for frequency fobs = fs 1+2 
Relativistic p= ymti = —Z8 
momentum 1-4 
Relativistic total E= yme?, where y= 1 7 
energy ei) 
Relativistic kinetic Kx = (7 = 1)me?, where Sa! 1 : 
energy 1-*% 


Conceptual Questions 


Exercise: 
Problem: 
How are the classical laws of conservation of energy and conservation of mass 
modified by modern relativity? 


Exercise: 


Problem: 


What happens to the mass of water in a pot when it cools, assuming no molecules 
escape or are added? Is this observable in practice? Explain. 


Solution: 


Because it loses thermal energy, which is the kinetic energy of the random motion of its 
constituent particles, its mass decreases by an extremely small amount, as described by 
energy-mass equivalence. 


Exercise: 
Problem: 
Consider a thought experiment. You place an expanded balloon of air on weighing 
scales outside in the early morning. The balloon stays on the scales and you are able to 


measure changes in its mass. Does the mass of the balloon change as the day 
progresses? Discuss the difficulties in carrying out this experiment. 


Exercise: 
Problem: 
The mass of the fuel in a nuclear reactor decreases by an observable amount as it puts 


out energy. Is the same true for the coal and oxygen combined in a conventional power 
plant? If so, is this observable in practice for the coal and oxygen? Explain. 


Solution: 


Yes, in principle there would be a similar effect on mass for any decrease in energy, but 
the change would be so small for the energy changes in a chemical reaction that it 
would be undetectable in practice. 


Exercise: 


Problem: 


We know that the velocity of an object with mass has an upper limit of c. Is there an 
upper limit on its momentum? Its energy? Explain. 


Exercise: 


Problem: Given the fact that light travels at c , can it have mass? Explain. 


Solution: 


Not according to special relativity. Nothing with mass can attain the speed of light. 


Exercise: 


Problem: 


If you use an Earth-based telescope to project a laser beam onto the moon, you can 
move the spot across the moon’s surface at a velocity greater than the speed of light. 
Does this violate modern relativity? (Note that light is being sent from the Earth to the 
moon, not across the surface of the moon.) 


Problems 


Exercise: 
Problem: 


What is the rest energy of an electron, given its mass is 9.11 x 10~°! kg? Give your 
answer in joules and MeV. 


Solution: 
0.512 MeV according to the number of significant figures stated. The exact value is 
closer to 0.511 MeV. 

Exercise: 


Problem: 


Find the rest energy in joules and MeV of a proton, given its mass is 1.67 x 10-7" kg. 
Exercise: 
Problem: 


If the rest energies of a proton and a neutron (the two constituents of nuclei) are 938.3 
and 939.6 MeV, respectively, what is the difference in their mass in kilograms? 


Solution: 
23% 10° kg; to two digits because the difference in rest mass energies is found to 
two digits 
Exercise: 
Problem: 
The Big Bang that began the universe is estimated to have released 10° J of energy. 


How many stars could half this energy create, assuming the average star’s mass is 
4.00 x 10° kg ? 


Exercise: 


Problem: 


A supernova explosion of a 2.00 x 10*! kg star produces 1.00 x 10% J of energy. 
(a) How many kilograms of mass are converted to energy in the explosion? (b) What is 
the ratio Am/m of mass destroyed to the original mass of the star? 


Solution: 


a.1.11 x 107’ kg; b.5.56 x 10° 
Exercise: 
Problem: 
(a) Using data from Potential Energy of a System, calculate the mass converted to 


energy by the fission of 1.00 kg of uranium. (b) What is the ratio of mass destroyed to 
the original mass, Am/m? 


Exercise: 
Problem: 
(a) Using data from Potential Energy of a System, calculate the amount of mass 
converted to energy by the fusion of 1.00 kg of hydrogen. (b) What is the ratio of mass 


destroyed to the original mass, Am /m ? (c) How does this compare with Am/m for 
the fission of 1.00 kg of uranium? 


Solution: 


a. 7.1 x 107? kg; b. 7.1 x 10-7 =7.1 x 1077;¢ — is greater for hydrogen 
Exercise: 


Problem: 


There is approximately 10*4 J of energy available from fusion of hydrogen in the 
world’s oceans. (a) If 10°° J of this energy were utilized, what would be the decrease in 
mass of the oceans? (b) How great a volume of water does this correspond to? (c) 
Comment on whether this is a significant fraction of the total mass of the oceans. 


Exercise: 
Problem: 
A muon has a rest mass energy of 105.7 MeV, and it decays into an electron and a 


massless particle. (a) If all the lost mass is converted into the electron’s kinetic energy, 
find + for the electron. (b) What is the electron’s velocity? 


Solution: 


a. 208; b. 0.999988c; six digits used to show difference from c 
Exercise: 
Problem: 
A m-meson is a particle that decays into a muon and a massless particle. The 7-meson 
has a rest mass energy of 139.6 MeV, and the muon has a rest mass energy of 105.7 


MeV. Suppose the 7r-meson is at rest and all of the missing mass goes into the muon’s 
kinetic energy. How fast will the muon move? 


Exercise: 
Problem: 
(a) Calculate the relativistic kinetic energy of a 1000-kg car moving at 30.0 m/s if the 


speed of light were only 45.0 m/s. (b) Find the ratio of the relativistic kinetic energy to 
classical. 


Solution: 


a. 6.92 x 10°J;b. 1.54 
Exercise: 
Problem: 
Alpha decay is nuclear decay in which a helium nucleus is emitted. If the helium 


nucleus has a mass of 6.80 x 1072” kg and is given 5.00 MeV of kinetic energy, what 
is its velocity? 


Exercise: 
Problem: 
(a) Beta decay is nuclear decay in which an electron is emitted. If the electron is given 
0.750 MeV of kinetic energy, what is its velocity? (b) Comment on how the high 


velocity is consistent with the kinetic energy as it compares to the rest mass energy of 
the electron. 


Solution: 
a. 0.914c; b. The rest mass energy of an electron is 0.511 MeV, so the kinetic energy is 
approximately 150% of the rest mass energy. The electron should be traveling close to 
the speed of light. 

Additional Problems 


Exercise: 


Problem: 


(a) At what relative velocity is y = 1.50? (b) At what relative velocity is y = 100? 
Exercise: 


Problem: 
(a) At what relative velocity is y = 2.00? (b) At what relative velocity is y = 10.0? 


Solution: 


a. 0.866c; b. 0.995c 
Exercise: 
Problem: 
Unreasonable Results (a) Find the value of required for the following situation. An 
earthbound observer measures 23.9 h to have passed while signals from a high-velocity 


space probe indicate that 24.0 h have passed on board. (b) What is unreasonable about 
this result? (c) Which assumptions are unreasonable or inconsistent? 


Exercise: 
Problem: 
(a) How long does it take the astronaut in [link] to travel 4.30 ly at 0.99944c (as 
measured by the earthbound observer)? (b) How long does it take according to the 


astronaut? (c) Verify that these two times are related through time dilation with 
7 = 30.00 as given. 


Solution: 


a. 4.303 y to four digits to show any effect; b. 0.1434 y; c. 1/4/(1 — v?/c?) = 29.88. 
Exercise: 

Problem: 

(a) How fast would an athlete need to be running for a 100-m race to look 100 yd long? 


(b) Is the answer consistent with the fact that relativistic effects are difficult to observe 
in ordinary circumstances? Explain. 


Exercise: 
Problem: 
(a) Find the value of + for the following situation. An astronaut measures the length of 


his spaceship to be 100 m, while an earthbound observer measures it to be 25.0 m. (b) 
What is the speed of the spaceship relative to Earth? 


Solution: 


a. 4.00; b. v = 0.867c 
Exercise: 


Problem: 


A clock in a spaceship runs one-tenth the rate at which an identical clock on Earth runs. 
What is the speed of the spaceship? 


Exercise: 


Problem: 


An astronaut has a heartbeat rate of 66 beats per minute as measured during his 
physical exam on Earth. The heartbeat rate of the astronaut is measured when he is in a 
spaceship traveling at 0.5c with respect to Earth by an observer (A) in the ship and by 
an observer (B) on Earth. (a) Describe an experimental method by which observer B on 
Earth will be able to determine the heartbeat rate of the astronaut when the astronaut is 
in the spaceship. (b) What will be the heartbeat rate(s) of the astronaut reported by 
observers A and B? 


Solution: 


a. A sends a radio pulse at each heartbeat to B, who knows their relative velocity and 
uses the time dilation formula to calculate the proper time interval between heartbeats 


from the observed signal. b. (66 beats/min) \/1 — v2/c? = 57.1 beats/min 
Exercise: 


Problem: 


A spaceship (A) is moving at speed c/2 with respect to another spaceship (B). 
Observers in A and B set their clocks so that the event at (x, y, z, t) of turning on a laser 
in spaceship B has coordinates (0, 0, 0, 0) in A and also (0, 0, 0, 0) in B. An observer at 
the origin of B turns on the laser at ¢ = O and turns it off at £ = 7 in his time. What is 
the time duration between on and off as seen by an observer in A? 


Exercise: 


Problem: 


Same two observers as in the preceding exercise, but now we look at two events 
occurring in spaceship A. A photon arrives at the origin of A at its time ¢ = 0 and 
another photon arrives at (2 = 1.00 m, 0, 0) at ¢ = 0 in the frame of ship A. (a) Find 
the coordinates and times of the two events as seen by an observer in frame B. (b) In 
which frame are the two events simultaneous and in which frame are they are not 
simultaneous? 


Solution: 


a. first photon: (0, 0,0) at ¢ = t’; second photon: 
— _=vaele? = (6/2)(1.00m)/e? _.577m _ -9 
he a = EE = 1.93 x 10s 
x — 100m _ 
(eae ae = 
b. simultaneous in A, not simultaneous in B 


zl 


Exercise: 
Problem: 
Same two observers as in the preceding exercises. A rod of length 1 m is laid out on the 


x-axis in the frame of B from origin to (2 = 1.00 m, 0, 0). What is the length of the 
rod observed by an observer in the frame of spaceship A? 


Exercise: 
Problem: 
An observer at origin of inertial frame S sees a flashbulb go off at 
z = 150km, y = 15.0 km, and z = 1.00 kmat time t = 4.5 x 1074s. At what time 


and position in the S/ system did the flash occur, if S/ is moving along shared x- 
direction with S at a velocity v = 0.6c? 


Solution: 
yo = tte (4.5 x 104s) ~(0.6c) (#23 ) 
VARHe 1-(0.6) 
= 1.88 x 1074s 
at = —Saut_ _ 150 x 10° m~(0.60) (3.00 x 10" m/s) (4.5 x 107s) 
ae 1—(0.6)? 


= 8.6 x 10*m= 86km 
y = y=15km 
2 = 2/= ke 
Exercise: 
Problem: 
An observer sees two events 1.5 x 10~°s apart at a separation of 800 m. How fast 


must a second observer be moving relative to the first to see the two events occur 
simultaneously? 


Exercise: 


Problem: 


An observer standing by the railroad tracks sees two bolts of lightning strike the ends 
of a 500-m-long train simultaneously at the instant the middle of the train passes him at 
50 m/s. Use the Lorentz transformation to find the time between the lightning strikes as 
measured by a passenger seated in the middle of the train. 


Solution: 
Ati+vAa/c? 
Bi ae 
0 = At!+v(500 m)/c? . 
Ji-v/e ? 


since v < c, we can ignore the term v?/c? and find 


bur — aati = 2.78 3 10° 


The breakdown of Newtonian simultaneity is negligibly small, but not exactly zero, at 
realistic train speeds of 50 m/s. 


Exercise: 


Problem: 


Two astronomical events are observed from Earth to occur at a time of 1s apart anda 
distance separation of 1.5 x 10°m from each other. (a) Determine whether separation 
of the two events is space like or time like. (b) State what this implies about whether it 
is consistent with special relativity for one event to have caused the other? 


Exercise: 


Problem: 


Two astronomical events are observed from Earth to occur at a time of 0.30 s apart and 
a distance separation of 2.0 x 10°m from each other. How fast must a spacecraft 
travel from the site of one event toward the other to make the events occur at the same 
time when measured in the frame of reference of the spacecraft? 


Solution: 
— At-vAz/c? 
Atl = Jaye 
_&) (2.0 x 109 m) 
i= ot) (3.00 x 108 m/s)” 
a J1-v?/e2 
0.30s 2 
U (eta (3.00 x 108 m/s) 


vy = 1.35 x 10’m/s 


Exercise: 


Problem: 


A spacecraft starts from being at rest at the origin and accelerates at a constant rate g, 
as seen from Earth, taken to be an inertial frame, until it reaches a speed of c/2. (a) 
Show that the increment of proper time is related to the elapsed time in Earth’s frame 
by: 

Equation: 


drt = 1 —v?/c?dt. 


(b) Find an expression for the elapsed time to reach speed c/2 as seen in Earth’s frame. 
(c) Use the relationship in (a) to obtain a similar expression for the elapsed proper time 
to reach c/2 as seen in the spacecraft, and determine the ratio of the time seen from 
Earth with that on the spacecraft to reach the final speed. 


Exercise: 
Problem: 
(a) All but the closest galaxies are receding from our own Milky Way Galaxy. If a 
galaxy 12.0 x 10° ly away is receding from us at 0.900c, at what velocity relative to 
us must we send an exploratory probe to approach the other galaxy at 0.990c as 
measured from that galaxy? (b) How long will it take the probe to reach the other 
galaxy as measured from Earth? You may assume that the velocity of the other galaxy 


remains constant. (c) How long will it then take for a radio signal to be beamed back? 
(All of this is possible in principle, but not practical.) 


Solution: 

Note that all answers to this problem are reported to five significant figures, to 

distinguish the results. a. 0.99947c; b.1.2064 x 101! y;c.1.2058 x 10! y 
Exercise: 

Problem: 

Suppose a spaceship heading straight toward the Earth at 0.750c can shoot a canister at 


0.500c relative to the ship. (a) What is the velocity of the canister relative to Earth, if it 
is shot directly at Earth? (b) If it is shot directly away from Earth? 


Exercise: 


Problem: 


Repeat the preceding problem with the ship heading directly away from Earth. 


Solution: 


a. —0.400c; b. -0.909c 
Exercise: 
Problem: 
If a spaceship is approaching the Earth at 0.100c and a message capsule is sent toward 
it at 0.100c relative to Earth, what is the speed of the capsule relative to the ship? 
Exercise: 
Problem: 
(a) Suppose the speed of light were only 3000 m/s. A jet fighter moving toward a target 
on the ground at 800 m/s shoots bullets, each having a muzzle velocity of 1000 m/s. 


What are the bullets’ velocity relative to the target? (b) If the speed of light was this 
small, would you observe relativistic effects in everyday life? Discuss. 


Solution: 


a. 1.65 km/s; b. Yes, if the speed of light were this small, speeds that we can achieve in 
everyday life would be larger than 1% of the speed of light and we could observe 
relativistic effects much more often. 


Exercise: 
Problem: 
If a galaxy moving away from the Earth has a speed of 1000 km/s and emits 656 nm 
light characteristic of hydrogen (the most common element in the universe). (a) What 


wavelength would we observe on Earth? (b) What type of electromagnetic radiation is 
this? (c) Why is the speed of Earth in its orbit negligible here? 


Exercise: 


Problem: 


A space probe speeding towards the nearest star moves at 0.250c and sends radio 
information at a broadcast frequency of 1.00 GHz. What frequency is received on 
Earth? 


Solution: 


775 MHz 


Exercise: 


Problem: 


Near the center of our galaxy, hydrogen gas is moving directly away from us in its orbit 
about a black hole. We receive 1900 nm electromagnetic radiation and know that it was 
1875 nm when emitted by the hydrogen gas. What is the speed of the gas? 


Exercise: 
Problem: 
(a) Calculate the speed of a 1.00-1g particle of dust that has the same momentum as a 


proton moving at 0.999c. (b) What does the small speed tell us about the mass of a 
proton compared to even a tiny amount of macroscopic matter? 


Solution: 
do 110410. © m/s; b. The small speed tells us that the mass of a protein is 
substantially smaller than that of even a tiny amount of macroscopic matter. 
Exercise: 
Problem: 
(a) Calculate y for a proton that has a momentum of 1.00 kg - m/s. (b) What is its 
speed? Such protons form a rare component of cosmic radiation with uncertain origins. 
Exercise: 


Problem: 


Show that the relativistic form of Newton’s second law is (a) F = m2 1 


dt 32 
(1-u?/c?) 

(b) Find the force needed to accelerate a mass of 1 kg by 1 m/ s* when it is traveling at 

a velocity of c/2. 


Solution: 
a. 

_ dp _d mu 
P= $= a( eee) 


— du m — 1 _omv?’ 9d. 
~~ dt jw /2 2 (1—u2/c2)3/? dt? 
—_ ms du 
(1—u2/c2)3/? dt 


o. 


= m du 
io (1—u2/c2)3/” dt 


may m/s”) 


= 1.53N 


Exercise: 


Problem: 


A positron is an antimatter version of the electron, having exactly the same mass. When 
a positron and an electron meet, they annihilate, converting all of their mass into 
energy. (a) Find the energy released, assuming negligible kinetic energy before the 
annihilation. (b) If this energy is given to a proton in the form of kinetic energy, what is 
its velocity? (c) If this energy is given to another electron in the form of kinetic energy, 
what is its velocity? 


Exercise: 
Problem: 
What is the kinetic energy in MeV of a m-meson that lives 1.40 x 107° s as measured 


in the laboratory, and 0.840 x 10~'°s when at rest relative to an observer, given that 
its rest energy is 135 MeV? 


Solution: 


90.0 MeV 
Exercise: 
Problem: 
Find the kinetic energy in MeV of a neutron with a measured life span of 2065 s, given 
its rest energy is 939.6 MeV, and rest life span is 900s. 
Exercise: 


Problem: 


(a) Show that (pc)? / (me?) = y? — 1. This means that at large velocities 
pe >> mc’. (b) Is E © pe when y = 30.0, as for the astronaut discussed in the twin 
paradox? 


Solution: 


a. y? — 1;b. yes 


Exercise: 


Problem: 


One cosmic ray neutron has a velocity of 0.250c relative to the Earth. (a) What is the 
neutron’s total energy in MeV? (b) Find its momentum. (c) Is F ~ pc in this situation? 
Discuss in terms of the equation given in part (a) of the previous problem. 


Exercise: 
Problem: 


What is y for a proton having a mass energy of 938.3 MeV accelerated through an 
effective potential of 1.0 TV (teravolt)? 


Solution: 


1,07 x 10° 
Exercise: 
Problem: 
(a) What is the effective accelerating potential for electrons at the Stanford Linear 


Accelerator, if y = 1.00 x 10° for them? (b) What is their total energy (nearly the 
same as kinetic in this case) in GeV? 


Exercise: 
Problem: 
(a) Using data from Potential Energy of a System, find the mass destroyed when the 


energy in a barrel of crude oil is released. (b) Given these barrels contain 200 liters and 


assuming the density of crude oil is 750kg/ m’, what is the ratio of mass destroyed to 
original mass, Am/m? 


Solution: 


a.6.56 x 10° *kg; b. m = (200L) (1 m*/1000 L) (750 kg/m’) = 150 kg; 
therefore, —- = 4.37 x 19710 

Exercise: 
Problem: 
(a) Calculate the energy released by the destruction of 1.00 kg of mass. (b) How many 
kilograms could be lifted to a 10.0 km height by this amount of energy? 


Exercise: 


Problem: 


A Van de Graaff accelerator utilizes a 50.0 MV potential difference to accelerate 
charged particles such as protons. (a) What is the velocity of a proton accelerated by 
such a potential? (b) An electron? 


Solution: 


a. 0.314c; b. 0.99995c (Five digits used to show difference from c) 
Exercise: 
Problem: 
Suppose you use an average of 500 kW - h of electric energy per month in your home. 
(a) How long would 1.00 g of mass converted to electric energy with an efficiency of 


38.0% last you? (b) How many homes could be supplied at the 500 kW - h per month 
rate for one year by the energy from the described mass conversion? 


Exercise: 
Problem: 
(a) A nuclear power plant converts energy from nuclear fission into electricity with an 
efficiency of 35.0%. How much mass is destroyed in one year to produce a continuous 


1000 MW of electric power? (b) Do you think it would be possible to observe this mass 
loss if the total mass of the fuel is 104 kg? 


Solution: 
a. 1.00 kg; b. This much mass would be measurable, but probably not observable just 
by looking because it is 0.01% of the total mass. 

Exercise: 
Problem: 
Nuclear-powered rockets were researched for some years before safety concerns 
became paramount. (a) What fraction of a rocket’s mass would have to be destroyed to 
get it into a low Earth orbit, neglecting the decrease in gravity? (Assume an orbital 
altitude of 250 km, and calculate both the kinetic energy (classical) and the 


gravitational potential energy needed.) (b) If the ship has a mass of 1.00 x 10°kg 
(100 tons), what total yield nuclear explosion in tons of TNT is needed? 


Exercise: 


Problem: 


The sun produces energy at a rate of 3.85 x 107° W by the fusion of hydrogen. About 
0.7% of each kilogram of hydrogen goes into the energy generated by the Sun. (a) How 
many kilograms of hydrogen undergo fusion each second? (b) If the sun is 90.0% 
hydrogen and half of this can undergo fusion before the sun changes character, how 
long could it produce energy at its current rate? (c) How many kilograms of mass is the 
sun losing per second? (d) What fraction of its mass will it have lost in the time found 
in part (b)? 


Solution: 


a. 6.06 x 10! kg/s;b. 4.67 x 10° y;c. 4.27 x 10°kg; d. 0.32% 
Exercise: 


Problem: 


Show that E? — p?c? for a particle is invariant under Lorentz transformations. 


Glossary 


relativistic kinetic energy 
kinetic energy of an object moving at relativistic speeds 


rest energy 


energy stored in an object at rest: Hy = mc? 


speed of light 
ultimate speed limit for any particle having mass 


total energy 
sum of all energies for a particle, including rest energy and kinetic energy, given for a 
particle of mass m and speed u by E = ymc?, where y = 


2 
4 eae 
c2 


Introduction 
class="introduction' 


In this 
image of 
pollen taken 
with an 
electron 
microscope, 
the bean- 
shaped 
grains are 
about 50um 
long. 
Electron 
microscopes 
can have a 
much higher 
resolving 
power than 
a 
conventiona 
| light 
microscope 
because 
electron 
wavelengths 
can be 
100,000 
times 
shorter than 
the 
wavelengths 
of visible- 
light 
photons. 
(credit: 


modification 
of work by 
Dartmouth 
College 
Electron 
Microscope 
Facility) 


Two of the most revolutionary concepts of the twentieth century were the 
description of light as a collection of particles, and the treatment of particles 
as waves. These wave properties of matter have led to the discovery of 
technologies such as electron microscopy, which allows us to examine 
submicroscopic objects such as grains of pollen, as shown above. 


In this chapter, you will learn about the energy quantum, a concept that was 
introduced in 1900 by the German physicist Max Planck to explain 
blackbody radiation. We discuss how Albert Einstein extended Planck’s 
concept to a quantum of light (a “photon”) to explain the photoelectric 
effect. We also show how American physicist Arthur H. Compton used the 
photon concept in 1923 to explain wavelength shifts observed in X-rays. 
After a discussion of Bohr’s model of hydrogen, we describe how matter 
waves were postulated in 1924 by Louis-Victor de Broglie to justify Bohr’s 
model and we examine the experiments conducted in 1923-1927 by Clinton 


Davisson and Lester Germer that confirmed the existence of de Broglie’s 
matter waves. 


Blackbody Radiation 
By the end of this section you will be able to: 


e Apply Wien’s and Stefan’s laws to analyze radiation emitted by a blackbody 
e Explain Planck’s hypothesis of energy quanta 


All bodies emit electromagnetic radiation over a range of wavelengths. In an earlier chapter, we learned that 
a cooler body radiates less energy than a warmer body. We also know by observation that when a body is 
heated and its temperature rises, the perceived wavelength of its emitted radiation changes from infrared to 
red, and then from red to orange, and so forth. As its temperature rises, the body glows with the colors 
corresponding to ever-smaller wavelengths of the electromagnetic spectrum. This is the underlying principle 
of the incandescent light bulb: A hot metal filament glows red, and when heating continues, its glow 
eventually covers the entire visible portion of the electromagnetic spectrum. The temperature (T) of the 
object that emits radiation, or the emitter, determines the wavelength at which the radiated energy is at its 
maximum. For example, the Sun, whose surface temperature is in the range between 5000 K and 6000 K, 
radiates most strongly in a range of wavelengths about 560 nm in the visible part of the electromagnetic 
spectrum. Your body, when at its normal temperature of about 300 K, radiates most strongly in the infrared 
part of the spectrum. 


Radiation that is incident on an object is partially absorbed and partially reflected. At thermodynamic 
equilibrium, the rate at which an object absorbs radiation is the same as the rate at which it emits it. 
Therefore, a good absorber of radiation (any object that absorbs radiation) is also a good emitter. A perfect 
absorber absorbs all electromagnetic radiation incident on it; such an object is called a blackbody. 


Although the blackbody is an idealization, because no physical object absorbs 100% of incident radiation, 
we can construct a close realization of a blackbody in the form of a small hole in the wall of a sealed 
enclosure known as a cavity radiator, as shown in [link]. The inside walls of a cavity radiator are rough and 
blackened so that any radiation that enters through a tiny hole in the cavity wall becomes trapped inside the 
cavity. At thermodynamic equilibrium (at temperature T), the cavity walls absorb exactly as much radiation 
as they emit. Furthermore, inside the cavity, the radiation entering the hole is balanced by the radiation 
leaving it. The emission spectrum of a blackbody can be obtained by analyzing the light radiating from the 
hole. Electromagnetic waves emitted by a blackbody are called blackbody radiation. 


A blackbody is physically realized by a small hole in 
the wall of a cavity radiator. 


The intensity I(\, T) of blackbody radiation depends on the wavelength of the emitted radiation and on 
the temperature T of the blackbody ([link]). The function (A, T’) is the power intensity that is radiated per 
unit wavelength; in other words, it is the power radiated per unit area of the hole in a cavity radiator per unit 
wavelength. According to this definition, [(A, T’)dA is the power per unit area that is emitted in the 
wavelength interval from A to A + dX. The intensity distribution among wavelengths of radiation emitted by 
cavities was studied experimentally at the end of the nineteenth century. Generally, radiation emitted by 
materials only approximately follows the blackbody radiation curve ([link]); however, spectra of common 
stars do follow the blackbody radiation curve very closely. 
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The intensity of blackbody radiation versus the wavelength of the emitted radiation. Each curve 
corresponds to a different blackbody temperature, starting with a low temperature (the lowest curve) to 
a high temperature (the highest curve). 
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The spectrum of radiation emitted from a quartz 
surface (blue curve) and the blackbody radiation 
curve (black curve) at 600 K. 


Two important laws summarize the experimental findings of blackbody radiation: Wien’s displacement law 
and Stefan’s law. Wien’s displacement law is illustrated in [link] by the curve connecting the maxima on the 
intensity curves. In these curves, we see that the hotter the body, the shorter the wavelength corresponding to 
the emission peak in the radiation curve. Quantitatively, Wien’s law reads 


Note: 
Equation: 


Vmax’ = 2.898 x 10-*m-K 


where Ajax is the position of the maximum in the radiation curve. In other words, Amax is the wavelength at 
which a blackbody radiates most strongly at a given temperature T. Note that in [link], the temperature is in 
kelvins. Wien’s displacement law allows us to estimate the temperatures of distant stars by measuring the 
wavelength of radiation they emit. 


Example: 

Temperatures of Distant Stars 

On a clear evening during the winter months, if you happen to be in the Northern Hemisphere and look up 
at the sky, you can see the constellation Orion (The Hunter). One star in this constellation, Rigel, flickers in 


a blue color and another star, Betelgeuse, has a reddish color, as shown in [link]. Which of these two stars is 
cooler, Betelgeuse or Rigel? 
Strategy 


We treat each star as a blackbody. Then according to Wien’s law, its temperature is inversely proportional to 


the wavelength of its peak intensity. The wavelength Oe 


pees, of red light. Even if we do not know the precise wavelengths, we can still set up a proportion. 
Solution 
Writing Wien’s law for the blue star and for the red star, we have 


Equation: 


of blue light is shorter than the wavelength 


Nace Vices) = 2-898 x 10 *mieK = Ane Taig 


When simplified, [link] gives 
Equation: 
(blue) 
max 
Led) = eed) (blue) < Tite) 
Amax 


Therefore, Betelgeuse is cooler than Rigel. 

Significance 

Note that Wien’s displacement law tells us that the higher the temperature of an emitting body, the shorter 
the wavelength of the radiation it emits. The qualitative analysis presented in this example is generally valid 
for any emitting body, whether it is a big object such as a star or a small object such as the glowing filament 
in an incandescent lightbulb. 


Note: 
Exercise: 


Problem: 


Check Your Understanding The flame of a peach-scented candle has a yellowish color and the flame 
of a Bunsen’s burner in a chemistry lab has a bluish color. Which flame has a higher temperature? 


Solution: 


Bunsen’s burner 
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In the Orion constellation, the red star Betelgeuse, which usually takes on a yellowish tint, appears as 
the figure’s right shoulder (in the upper left). The giant blue star on the bottom right is Rigel, which 
appears as the hunter’s left foot. (credit left: modification of work by Matthew Spinelli, NASA APOD) 


The second experimental relation is Stefan’s law, which concerns the total power of blackbody radiation 
emitted across the entire spectrum of wavelengths at a given temperature. In [link], this total power is 
represented by the area under the blackbody radiation curve for a given T. As the temperature of a blackbody 
increases, the total emitted power also increases. Quantitatively, Stefan’s law expresses this relation as 


Note: 
Equation: 


PD aAg 


where A is the surface area of a blackbody, T is its temperature (in kelvins), and o is the Stefan—Boltzmann 


constant, ¢ = 5.670 x 10-°W/(m? - K*). Stefan’s law enables us to estimate how much energy a star is 
radiating by remotely measuring its temperature. 


Example: 

Power Radiated by Stars 

A star such as our Sun will eventually evolve to a “red giant” star and then to a “white dwarf” star. A typical 
white dwarf is approximately the size of Earth, and its surface temperature is about 2.5 x 10*K. A typical 
red giant has a surface temperature of 3.0 x 10°K anda radius ~100,000 times larger than that of a white 


dwarf. What is the average radiated power per unit area and the total power radiated by each of these types 
of stars? How do they compare? 

Strategy 

If we treat the star as a blackbody, then according to Stefan’s law, the total power that the star radiates is 
proportional to the fourth power of its temperature. To find the power radiated per unit area of the surface, 
we do not need to make any assumptions about the shape of the star because P/A depends only on 
temperature. However, to compute the total power, we need to make an assumption that the energy radiates 
through a spherical surface enclosing the star, so that the surface area is A = 47.R?, where R is its radius. 
Solution 

A simple proportion based on Stefan’s law gives 

Equation: 


4 
Paver) Adaark =e alle = (pe) = (= x <) — 4820 


pene) lgiane 7 road Be 3.0 x 10° 


giant Tyiant 


The power emitted per unit area by a white dwarf is about 5000 times that the power emitted by a red giant. 
Denoting this ratio by a = 4.8 x 10%, [link] gives 
Equation: 


P. War»! A War 4nR? R War: . R war: 7. 
SS a go ag =o( =) =4.8 x 108 (eee a ) =4.8 x 107 
P. giant Agiant ATR? ant Reiant 10° Rawart 
We see that the total power emitted by a white dwarf is a tiny fraction of the total power emitted by a red 
giant. Despite its relatively lower temperature, the overall power radiated by a red giant far exceeds that of 
the white dwarf because the red giant has a much larger surface area. To estimate the absolute value of the 
emitted power per unit area, we again use Stefan’s law. For the white dwarf, we obtain 
Equation: 
aera W 
“dwarf _ oT. = 5.670 x 10-8 (2.5 x 104K)* =2.2 x 10!°W/m? 
Agere m? - K4 


The analogous result for the red giant is obtained by scaling the result for a white dwarf: 
Equation: 


Peiant a Bots XX 101° W 
Agiant 4.82 x 10° m? 


= 4.56 x 10° ~4.6 x 19° 
m2 m2 


Significance 

To estimate the total power emitted by a white dwarf, in principle, we could use [link]. However, to find its 
surface area, we need to know the average radius, which is not given in this example. Therefore, the 
solution stops here. The same is also true for the red giant star. 


Note: 
Exercise: 


Problem: 
Check Your Understanding An iron poker is being heated. As its temperature rises, the poker begins 


to glow—first dull red, then bright red, then orange, and then yellow. Use either the blackbody 
radiation curve or Wien’s law to explain these changes in the color of the glow. 


Solution: 


The wavelength of the radiation maximum decreases with increasing temperature. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Suppose that two stars, a and {, radiate exactly the same total power. If 
the radius of star a is three times that of star 6, what is the ratio of the surface temperatures of these 
stars? Which one is hotter? 


Solution: 


Tx /Tg = 1/ V3 & 0.58, so the star f is hotter. 


The term “blackbody” was coined by Gustav R. Kirchhoff in 1862. The blackbody radiation curve was 
known experimentally, but its shape eluded physical explanation until the year 1900. The physical model of 
a blackbody at temperature T is that of the electromagnetic waves enclosed in a cavity (see [link]) and at 
thermodynamic equilibrium with the cavity walls. The waves can exchange energy with the walls. The 
objective here is to find the energy density distribution among various modes of vibration at various 
wavelengths (or frequencies). In other words, we want to know how much energy is carried by a single 
wavelength or a band of wavelengths. Once we know the energy distribution, we can use standard statistical 
methods (similar to those studied in a previous chapter) to obtain the blackbody radiation curve, Stefan’s 
law, and Wien’s displacement law. When the physical model is correct, the theoretical predictions should be 
the same as the experimental curves. 


In a classical approach to the blackbody radiation problem, in which radiation is treated as waves (as you 
have studied in previous chapters), the modes of electromagnetic waves trapped in the cavity are in 
equilibrium and continually exchange their energies with the cavity walls. There is no physical reason why a 
wave should do otherwise: Any amount of energy can be exchanged, either by being transferred from the 
wave to the material in the wall or by being received by the wave from the material in the wall. This classical 
picture is the basis of the model developed by Lord Rayleigh and, independently, by Sir James Jeans. The 
result of this classical model for blackbody radiation curves is known as the Rayleigh—Jeans law. However, 
as shown in [link], the Rayleigh—Jeans law fails to correctly reproduce experimental results. In the limit of 
short wavelengths, the Rayleigh—Jeans law predicts infinite radiation intensity, which is inconsistent with the 
experimental results in which radiation intensity has finite values in the ultraviolet region of the spectrum. 
This divergence between the results of classical theory and experiments, which came to be called the 
ultraviolet catastrophe, shows how classical physics fails to explain the mechanism of blackbody radiation. 


Experimental 
data 


Pal Rayleigh-Jeans 


Radiation Intensity 


0123 45 67 8 9 10 
Wavelength A (4m) 


The ultraviolet catastrophe: The 
Rayleigh—Jeans law does not 
explain the observed blackbody 
emission spectrum. 


The blackbody radiation problem was solved in 1900 by Max Planck. Planck used the same idea as the 
Rayleigh—Jeans model in the sense that he treated the electromagnetic waves between the walls inside the 
cavity classically, and assumed that the radiation is in equilibrium with the cavity walls. The innovative idea 
that Planck introduced in his model is the assumption that the cavity radiation originates from atomic 
oscillations inside the cavity walls, and that these oscillations can have only discrete values of energy. 
Therefore, the radiation trapped inside the cavity walls can exchange energy with the walls only in discrete 
amounts. Planck’s hypothesis of discrete energy values, which he called quanta, assumes that the oscillators 
inside the cavity walls have quantized energies. This was a brand new idea that went beyond the classical 
physics of the nineteenth century because, as you learned in a previous chapter, in the classical picture, the 
energy of an oscillator can take on any continuous value. Planck assumed that the energy of an oscillator (FE, 
) can have only discrete, or quantized, values: 


Note: 
Equation: 


EF, =nhf, wheren = 1,2,3,... 


In [link], f is the frequency of Planck’s oscillator. The natural number n that enumerates these discrete 
energies is called a quantum number. The physical constant h is called Planck’s constant: 


Note: 
Equation: 


h= 6626 10> Jos= 4136 x 10s eV «s 


Each discrete energy value corresponds to a quantum state of a Planck oscillator. Quantum states are 
enumerated by quantum numbers. For example, when Planck’s oscillator is in its first n = 1 quantum state, 
its energy is EF, = hf; when it is in the nm = 2 quantum state, its energy is Ey = 2hf; when it is in the 

n = 3 quantum state, #3 = 3hf; and so on. 


Note that [link] shows that there are infinitely many quantum states, which can be represented as a sequence 
{hf, 2hf, 3hf,..., (a — hf, nhf, (n+ 1)hf,...}. Each two consecutive quantum states in this sequence are 
separated by an energy jump, AF = Af. An oscillator in the wall can receive energy from the radiation in 
the cavity (absorption), or it can give away energy to the radiation in the cavity (emission). The absorption 
process sends the oscillator to a higher quantum state, and the emission process sends the oscillator to a 
lower quantum state. Whichever way this exchange of energy goes, the smallest amount of energy that can 
be exchanged is hf. There is no upper limit to how much energy can be exchanged, but whatever is 
exchanged must be an integer multiple of hf. If the energy packet does not have this exact amount, it is 
neither absorbed nor emitted at the wall of the blackbody. 


Note: 

Planck’s Quantum Hypothesis 

Planck’s hypothesis of energy quanta states that the amount of energy emitted by the oscillator is carried 
by the quantum of radiation, AE : 

Equation: 


AE =hf 


Recall that the frequency of electromagnetic radiation is related to its wavelength and to the speed of light by 
the fundamental relation fA = c. This means that we can express [link] equivalently in terms of wavelength 
A. When included in the computation of the energy density of a blackbody, Planck’s hypothesis gives the 
following theoretical expression for the power intensity of emitted radiation per unit wavelength: 


Note: 
Equation: 


2thc? 1 
HOT) a ehe/AkeT _ J 


where c is the speed of light in vacuum and kg is Boltzmann’s constant, kg = 1.380 x 10° *?J /K. The 
theoretical formula expressed in [link] is called Planck’s blackbody radiation law. This law is in agreement 
with the experimental blackbody radiation curve (see [link]). In addition, Wien’s displacement law and 
Stefan’s law can both be derived from [link]. To derive Wien’s displacement law, we use differential calculus 
to find the maximum of the radiation intensity curve I(A, T’). To derive Stefan’s law and find the value of 
the Stefan—Boltzmann constant, we use integral calculus and integrate I(A, T’) to find the total power 
radiated by a blackbody at one temperature in the entire spectrum of wavelengths from A = 0 to AX = oo. 
This derivation is left as an exercise later in this chapter. 
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Planck’s theoretical result 
(continuous curve) and the 
experimental blackbody radiation 
curve (dots). 


Example: 

Planck’s Quantum Oscillator 

A quantum oscillator in the cavity wall in [link] is vibrating at a frequency of 5.0 x 10'*Hz. Calculate the 
spacing between its energy levels. 

Strategy 

Energy states of a quantum oscillator are given by [link]. The energy spacing AF is obtained by finding the 
energy difference between two adjacent quantum states for quantum numbers n + 1 and n. 

Solution 

We can substitute the given frequency and Planck’s constant directly into the equation: 

Equation: 


AR = F44 — FE, = (n+ 1)hf —nhf = hf = (6.626 x 10 “J -s)(5.0 x 10°Hz) =3.3 x 10° Py 
Significance 


Note that we do not specify what kind of material was used to build the cavity. Here, a quantum oscillator is 
a theoretical model of an atom or molecule of material in the wall. 


Note: 
Exercise: 


Problem: 


Check Your Understanding A molecule is vibrating at a frequency of 5.0 x 10'Hz. What is the 
smallest spacing between its vibrational energy levels? 


Solution: 


2300. a 


Example: 

Quantum Theory Applied to a Classical Oscillator 

A 1.0-kg mass oscillates at the end of a spring with a spring constant of 1000 N/m. The amplitude of these 
oscillations is 0.10 m. Use the concept of quantization to find the energy spacing for this classical oscillator. 
Is the energy quantization significant for macroscopic systems, such as this oscillator? 

Strategy 

We use [link] as though the system were a quantum oscillator, but with the frequency f of the mass vibrating 
on a spring. To evaluate whether or not quantization has a significant effect, we compare the quantum 
energy spacing with the macroscopic total energy of this classical oscillator. 

Solution 

For the spring constant, k = 1.0 x 10°N/m, the frequency f of the mass, m = 1.0kg, is 


Equation: 
1 eR 0x 10°N 
i / ae He eee 
2n Vm 20 1.0kg 


The energy quantum that corresponds to this frequency is 
Equation: 


AE = hf = (6.626 x 10-“J-s)(5.0Hz) = 3.3 x 10°*J 


When vibrations have amplitude A = 0.10m, the energy of oscillations is 
Equation: 


1 1 
E= zh’ = 5 (L000 /m) (0.1m)° = 5.0J 


Significance 

Thus, for a classical oscillator, we have AE / E = 10~*4. We see that the separation of the energy levels is 
immeasurably small. Therefore, for all practical purposes, the energy of a classical oscillator takes on 
continuous values. This is why classical principles may be applied to macroscopic systems encountered in 
everyday life without loss of accuracy. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Would the result in [link] be different if the mass were not 1.0 kg but a 
tiny mass of 1.0 yg, and the amplitude of vibrations were 0.10 um? 


Solution: 


No, because then AE'/ E = 10771 


When Planck first published his result, the hypothesis of energy quanta was not taken seriously by the 
physics community because it did not follow from any established physics theory at that time. It was 
perceived, even by Planck himself, as a useful mathematical trick that led to a good theoretical “fit” to the 
experimental curve. This perception was changed in 1905 when Einstein published his explanation of the 
photoelectric effect, in which he gave Planck’s energy quantum a new meaning: that of a particle of light. 


Summary 


e All bodies radiate energy. The amount of radiation a body emits depends on its temperature. The 
experimental Wien’s displacement law states that the hotter the body, the shorter the wavelength 
corresponding to the emission peak in the radiation curve. The experimental Stefan’s law states that the 
total power of radiation emitted across the entire spectrum of wavelengths at a given temperature is 
proportional to the fourth power of the Kelvin temperature of the radiating body. 

e Absorption and emission of radiation are studied within the model of a blackbody. In the classical 
approach, the exchange of energy between radiation and cavity walls is continuous. The classical 
approach does not explain the blackbody radiation curve. 

e To explain the blackbody radiation curve, Planck assumed that the exchange of energy between 
radiation and cavity walls takes place only in discrete quanta of energy. Planck’s hypothesis of energy 
quanta led to the theoretical Planck’s radiation law, which agrees with the experimental blackbody 
radiation curve; it also explains Wien’s and Stefan’s laws. 


Conceptual Questions 


Exercise: 


Problem: Which surface has a higher temperature — the surface of a yellow star or that of a red star? 


Solution: 


yellow 
Exercise: 


Problem: 


Describe what you would see when looking at a body whose temperature is increased from 1000 K to 
1,000,000 K. 


Exercise: 


Problem: Explain the color changes in a hot body as its temperature is increased. 


Solution: 
goes from red to violet through the rainbow of colors 


Exercise: 


Problem: Speculate as to why UV light causes sunburn, whereas visible light does not. 


Exercise: 


Problem: 


Two cavity radiators are constructed with walls made of different metals. At the same temperature, how 
would their radiation spectra differ? 


Solution: 


would not differ 
Exercise: 


Problem: 


Discuss why some bodies appear black, other bodies appear red, and still other bodies appear white. 
Exercise: 
Problem: 


If everything radiates electromagnetic energy, why can we not see objects at room temperature in a dark 
room? 


Solution: 


human eye does not see IR radiation 
Exercise: 


Problem: 


How much does the power radiated by a blackbody increase when its temperature (in K) is tripled? 


Problems 


Exercise: 
Problem: 
A 200-W heater emits a 1.5-j1m radiation. (a) What value of the energy quantum does it emit? (b) 
Assuming that the specific heat of a 4.0-kg body is 0.83kcal /kg - K, how many of these photons must 


be absorbed by the body to increase its temperature by 2 K? (c) How long does the heating process in 
(b) take, assuming that all radiation emitted by the heater gets absorbed by the body? 


Solution: 


a. 0.81 eV; b. 2.1 x 10?3; c. 2 min 20 sec 

Exercise: 
Problem: 
A 900-W microwave generator in an oven generates energy quanta of frequency 2560 MHz. (a) How 
many energy quanta does it emit per second? (b) How many energy quanta must be absorbed by a pasta 
dish placed in the radiation cavity to increase its temperature by 45.0 K? Assume that the dish has a 


mass of 0.5 kg and that its specific heat is 0.9 kcal /kg - K. (c) Assume that all energy quanta emitted 
by the generator are absorbed by the pasta dish. How long must we wait until the dish in (b) is ready? 


Exercise: 


Problem: 


(a) For what temperature is the peak of blackbody radiation spectrum at 400 nm? (b) If the temperature 
of a blackbody is 800 K, at what wavelength does it radiate the most energy? 


Solution: 


a. 7245 K; b. 3.62 um 
Exercise: 
Problem: 
The tungsten elements of incandescent light bulbs operate at 3200 K. At what wavelength does the 
filament radiate maximum energy? 
Exercise: 
Problem: 


Interstellar space is filled with radiation of wavelength 970,1m. This radiation is considered to be a 
remnant of the “big bang.” What is the corresponding blackbody temperature of this radiation? 


Solution: 


about 3 K 
Exercise: 


Problem: 


The radiant energy from the sun reaches its maximum at a wavelength of about 500.0 nm. What is the 
approximate temperature of the sun’s surface? 


Glossary 


absorber 
any object that absorbs radiation 


blackbody 
perfect absorber/emitter 


blackbody radiation 
radiation emitted by a blackbody 


emitter 
any object that emits radiation 


Planck’s hypothesis of energy quanta 
energy exchanges between the radiation and the walls take place only in the form of discrete energy 
quanta 


power intensity 
energy that passes through a unit surface per unit time 


quantized energies 
discrete energies; not continuous 


quantum state of a Planck’s oscillator 
any mode of vibration of Planck’s oscillator, enumerated by quantum number 


Stefan—Boltzmann constant 
physical constant in Stefan’s law 


Photoelectric Effect 
By the end of this section you will be able to: 


¢ Describe physical characteristics of the photoelectric effect 
e Explain why the photoelectric effect cannot be explained by classical physics 
e Describe how Einstein’s idea of a particle of radiation explains the photoelectric effect 


When a metal surface is exposed to a monochromatic electromagnetic wave of sufficiently 
short wavelength (or equivalently, above a threshold frequency), the incident radiation is 
absorbed and the exposed surface emits electrons. This phenomenon is known as the 
photoelectric effect. Electrons that are emitted in this process are called photoelectrons. 


The experimental setup to study the photoelectric effect is shown schematically in [link]. The 
target material serves as the anode, which becomes the emitter of photoelectrons when it is 
illuminated by monochromatic radiation. We call this electrode the photoelectrode. 
Photoelectrons are collected at the cathode, which is kept at a lower potential with respect to 
the anode. The potential difference between the electrodes can be increased or decreased, or 
its polarity can be reversed. The electrodes are enclosed in an evacuated glass tube so that 
photoelectrons do not lose their kinetic energy on collisions with air molecules in the space 
between electrodes. 


When the target material is not exposed to radiation, no current is registered in this circuit 
because the circuit is broken (note, there is a gap between the electrodes). But when the 
target material is connected to the negative terminal of a battery and exposed to radiation, a 
current is registered in this circuit; this current is called the photocurrent. Suppose that we 
now reverse the potential difference between the electrodes so that the target material now 
connects with the positive terminal of a battery, and then we slowly increase the voltage. The 
photocurrent gradually dies out and eventually stops flowing completely at some value of 
this reversed voltage. The potential difference at which the photocurrent stops flowing is 
called the stopping potential. 


Incident 


Anode light 


(Target material) Cathode 


Electron flow Ammeter 


Voltmeter 


Rheostat 


An experimental setup to study the photoelectric effect. The 
anode and cathode are enclosed in an evacuated glass tube. The 
voltmeter measures the electric potential difference between the 

electrodes, and the ammeter measures the photocurrent. The 

incident radiation is monochromatic. 


Characteristics of the Photoelectric Effect 


The photoelectric effect has three important characteristics that cannot be explained by 
classical physics: (1) the absence of a lag time, (2) the independence of the kinetic energy of 
photoelectrons on the intensity of incident radiation, and (3) the presence of a cut-off 
frequency. Let’s examine each of these characteristics. 


The absence of lag time 


When radiation strikes the target material in the electrode, electrons are emitted almost 
instantaneously, even at very low intensities of incident radiation. This absence of lag time 
contradicts our understanding based on classical physics. Classical physics predicts that for 
low-energy radiation, it would take significant time before irradiated electrons could gain 
sufficient energy to leave the electrode surface; however, such an energy buildup is not 
observed. 


The intensity of incident radiation and the kinetic energy of photoelectrons 


Typical experimental curves are shown in [link], in which the photocurrent is plotted versus 
the applied potential difference between the electrodes. For the positive potential difference, 
the current steadily grows until it reaches a plateau. Furthering the potential increase beyond 
this point does not increase the photocurrent at all. A higher intensity of radiation produces a 
higher value of photocurrent. For the negative potential difference, as the absolute value of 
the potential difference increases, the value of the photocurrent decreases and becomes zero 
at the stopping potential. For any intensity of incident radiation, whether the intensity is high 
or low, the value of the stopping potential always stays at one value. 


To understand why this result is unusual from the point of view of classical physics, we first 
have to analyze the energy of photoelectrons. A photoelectron that leaves the surface has 
kinetic energy K. It gained this energy from the incident electromagnetic wave. In the space 
between the electrodes, a photoelectron moves in the electric potential and its energy changes 
by the amount gAV, where AV is the potential difference and g = —e. Because no forces 
are present but electric force, by applying the work-energy theorem, we obtain the energy 
balance AK — eAV = 0 for the photoelectron, where AK is the change in the 
photoelectron’s kinetic energy. When the stopping potential — AV, is applied, the 
photoelectron loses its initial kinetic energy A; and comes to rest. Thus, its energy balance 
becomes (0 — K;) — e(—AV,) = 0, so that K; = eAY,. In the presence of the stopping 
potential, the largest kinetic energy AKyax that a photoelectron can have is its initial kinetic 
energy, which it has at the surface of the photoelectrode. Therefore, the largest kinetic energy 
of photoelectrons can be directly measured by measuring the stopping potential: 


Note: 
Equation: 


Kaa = eAVs. 


At this point we can see where the classical theory is at odds with the experimental results. In 
classical theory, the photoelectron absorbs electromagnetic energy in a continuous way; this 
means that when the incident radiation has a high intensity, the kinetic energy in [link] is 
expected to be high. Similarly, when the radiation has a low intensity, the kinetic energy is 
expected to be low. But the experiment shows that the maximum kinetic energy of 
photoelectrons is independent of the light intensity. 


Photocurrent 


High intensity 


Low intensity 


Potential difference 


The detected photocurrent plotted 
versus the applied potential 
difference shows that for any 
intensity of incident radiation, 
whether the intensity is high (upper 
curve) or low (lower curve), the 
value of the stopping potential is 
always the same. 


The presence of a cut-off frequency 


For any metal surface, there is a minimum frequency of incident radiation below which 
photocurrent does not occur. The value of this cut-off frequency for the photoelectric effect 
is a physical property of the metal: Different materials have different values of cut-off 
frequency. Experimental data show a typical linear trend (see [link]). The kinetic energy of 
photoelectrons at the surface grows linearly with the increasing frequency of incident 
radiation. Measurements for all metal surfaces give linear plots with one slope. None of these 
observed phenomena is in accord with the classical understanding of nature. According to the 
classical description, the kinetic energy of photoelectrons should not depend on the 
frequency of incident radiation at all, and there should be no cut-off frequency. Instead, in the 
classical picture, electrons receive energy from the incident electromagnetic wave in a 


continuous way, and the amount of energy they receive depends only on the intensity of the 
incident light and nothing else. So in the classical understanding, as long as the light is 
shining, the photoelectric effect is expected to continue. 


Frequency f 


Kinetic energy of photoelectrons at the surface 
versus the frequency of incident radiation. The 
photoelectric effect can only occur above the cut- 
off frequency f.. Measurements for all metal 
surfaces give linear plots with one slope. Each 
metal surface has its own cut-off frequency. 


The Work Function 


The photoelectric effect was explained in 1905 by A. Einstein. Einstein reasoned that if 
Planck’s hypothesis about energy quanta was correct for describing the energy exchange 
between electromagnetic radiation and cavity walls, it should also work to describe energy 
absorption from electromagnetic radiation by the surface of a photoelectrode. He postulated 
that an electromagnetic wave carries its energy in discrete packets. Einstein’s postulate goes 
beyond Planck’s hypothesis because it states that the light itself consists of energy quanta. In 
other words, it states that electromagnetic waves are quantized. 


In Einstein’s approach, a beam of monochromatic light of frequency f is made of photons. A 
photon is a particle of light. Each photon moves at the speed of light and carries an energy 
quantum £;. A photon’s energy depends only on its frequency f. Explicitly, the energy of a 
photon is 


Note: 
Equation: 


where A is Planck’s constant. In the photoelectric effect, photons arrive at the metal surface 
and each photon gives away all of its energy to only one electron on the metal surface. This 
transfer of energy from photon to electron is of the “all or nothing” type, and there are no 
fractional transfers in which a photon would lose only part of its energy and survive. The 
essence of a quantum phenomenon is either a photon transfers its entire energy and ceases 
to exist or there is no transfer at all. This is in contrast with the classical picture, where 
fractional energy transfers are permitted. Having this quantum understanding, the energy 
balance for an electron on the surface that receives the energy / from a photon is 


Equation: 


where Kynax is the kinetic energy, given by [link], that an electron has at the very instant it 
gets detached from the surface. In this energy balance equation, ¢ is the energy needed to 
detach a photoelectron from the surface. This energy ¢ is called the work function of the 
metal. Each metal has its characteristic work function, as illustrated in [link]. To obtain the 
kinetic energy of photoelectrons at the surface, we simply invert the energy balance equation 
and use [link] to express the energy of the absorbed photon. This gives us the expression for 
the kinetic energy of photoelectrons, which explicitly depends on the frequency of incident 
radiation: 


Note: 
Equation: 


Kyax = hf — ¢. 


This equation has a simple mathematical form but its physics is profound. We can now 
elaborate on the physical meaning behind [link]. 


Typical Values of the Work Function for Some Common Metals 


Metal (eV) 
Na 2.46 
Al 4.08 
Pb 4.14 
Zn 4.31 
Fe 4.50 
Cu 4.70 
Ag 4.73 
Pt 6.35 


In Einstein’s interpretation, interactions take place between individual electrons and 
individual photons. The absence of a lag time means that these one-on-one interactions occur 
instantaneously. This interaction time cannot be increased by lowering the light intensity. The 
light intensity corresponds to the number of photons arriving at the metal surface per unit 
time. Even at very low light intensities, the photoelectric effect still occurs because the 
interaction is between one electron and one photon. As long as there is at least one photon 
with enough energy to transfer it to a bound electron, a photoelectron will appear on the 
surface of the photoelectrode. 


The existence of the cut-off frequency f, for the photoelectric effect follows from [link] 
because the kinetic energy Kyax of the photoelectron can take only positive values. This 
means that there must be some threshold frequency for which the kinetic energy is zero, 
0 = hf. — @. In this way, we obtain the explicit formula for cut-off frequency: 


Note: 
Equation: 


Cut-off frequency depends only on the work function of the metal and is in direct proportion 
to it. When the work function is large (when electrons are bound fast to the metal surface), 
the energy of the threshold photon must be large to produce a photoelectron, and then the 
corresponding threshold frequency is large. Photons with frequencies larger than the 
threshold frequency f, always produce photoelectrons because they have Kyax > 0. Photons 
with frequencies smaller than f, do not have enough energy to produce photoelectrons. 
Therefore, when incident radiation has a frequency below the cut-off frequency, the 
photoelectric effect is not observed. Because frequency f and wavelength A of 
electromagnetic waves are related by the fundamental relation Af = c (where c is the speed 
of light in vacuum), the cut-off frequency has its corresponding cut-off wavelength 4, : 
Equation: 


é c he 
Ag= —_— = — . 
f. o/h oo 


In this equation, hc = 1240 eV - nm. Our observations can be restated in the following 
equivalent way: When the incident radiation has wavelengths longer than the cut-off 
wavelength, the photoelectric effect does not occur. 


Example: 

Photoelectric Effect for Silver 

Radiation with wavelength 300 nm is incident on a silver surface. Will photoelectrons be 
observed? 

Strategy 

Photoelectrons can be ejected from the metal surface only when the incident radiation has a 
shorter wavelength than the cut-off wavelength. The work function of silver is 6 = 4.73 eV 
({link]). To make the estimate, we use [link]. 

Solution 

The threshold wavelength for observing the photoelectric effect in silver is 

Equation: 


he = 1240 eV - nm 
@  4.73eV 


Ae = = 262 nm. 


The incident radiation has wavelength 300 nm, which is longer than the cut-off wavelength; 
therefore, photoelectrons are not observed. 

Significance 

If the photoelectrode were made of sodium instead of silver, the cut-off wavelength would 
be 504 nm and photoelectrons would be observed. 


[link] in Einstein’s model tells us that the maximum kinetic energy of photoelectrons is a 
linear function of the frequency of incident radiation, which is illustrated in [link]. For any 
metal, the slope of this plot has a value of Planck’s constant. The intercept with the Kyax- 
axis gives us a value of the work function that is characteristic for the metal. On the other 
hand, Kynax can be directly measured in the experiment by measuring the value of the 
stopping potential AV, (see [link]) at which the photocurrent stops. These direct 
measurements allow us to determine experimentally the value of Planck’s constant, as well as 
work functions of materials. 


Einstein’s model also gives a straightforward explanation for the photocurrent values shown 
in [link]. For example, doubling the intensity of radiation translates to doubling the number 
of photons that strike the surface per unit time. The larger the number of photons, the larger 
is the number of photoelectrons, which leads to a larger photocurrent in the circuit. This is 
how radiation intensity affects the photocurrent. The photocurrent must reach a plateau at 
some value of potential difference because, in unit time, the number of photoelectrons is 
equal to the number of incident photons and the number of incident photons does not depend 
on the applied potential difference at all, but only on the intensity of incident radiation. The 
stopping potential does not change with the radiation intensity because the kinetic energy of 
photoelectrons (see [link]) does not depend on the radiation intensity. 


Example: 

Work Function and Cut-Off Frequency 

When a 180-nm light is used in an experiment with an unknown metal, the measured 
photocurrent drops to zero at potential — 0.80 V. Determine the work function of the metal 
and its cut-off frequency for the photoelectric effect. 

Strategy 

To find the cut-off frequency f., we use [link], but first we must find the work function @. 
To find ¢, we use [link] and [link]. Photocurrent drops to zero at the stopping value of 
potential, so we identify AV, = 0.8V. 

Solution 

We use [link] to find the kinetic energy of the photoelectrons: 

Equation: 


Kimax = eAV; = e (0.80V) = 0.80 eV. 


Now we solve [link] for ¢ : 
Equation: 


1240 eV - 
FT IL is iy ney 
A 180 nm 


Finally, we use [link] to find the cut-off frequency: 
Equation: 


o_ 6.09 eV 


= 1.47 x 10°! Hz. 
h 4,136 x 10°%eV-s 


f= 


Significance 
In calculations like the one shown in this example, it is convenient to use Planck’s constant 
in the units of eV - s and express all energies in eV instead of joules. 


Example: 

The Photon Energy and Kinetic Energy of Photoelectrons 

A 430-nm violet light is incident on a calcium photoelectrode with a work function of 2.71 
eV. 

Find the energy of the incident photons and the maximum kinetic energy of ejected 
electrons. 

Strategy 

The energy of the incident photon is Ey = hf = hc/ A, where we use fA = c. To obtain 
the maximum energy of the ejected electrons, we use [link]. 

Solution 

Equation: 


he _ 1240 eV -nm 
A 430 nm 


E;= = 2.88 eV, Kmax = Ey — 6 = 2.88 eV — 2.71eV =0.17eV 


Significance 
In this experimental setup, photoelectrons stop flowing at the stopping potential of 0.17 V. 


Note: 
Exercise: 


Problem: 

Check Your Understanding A yellow 589-nm light is incident on a surface whose 
work function is 1.20 eV. What is the stopping potential? What is the cut-off 
wavelength? 


Solution: 


—0.91 V; 1040 nm 


Note: 
Exercise: 


Problem: 


Check Your Understanding Cut-off frequency for the photoelectric effect in some 
materials is 8.0 x 10!°Hz. When the incident light has a frequency of 1.2 x 10'*Hz, 
the stopping potential is measured as — 0.16 V. Estimate a value of Planck’s constant 
from these data (in units J - s and eV - s) and determine the percentage error of your 
estimation. 


Solution: 


h =6.40 x 1073 -s=4.0 x 10°eV-s; —3.5% 


Summary 


e The photoelectric effect occurs when photoelectrons are ejected from a metal surface in 
response to monochromatic radiation incident on the surface. It has three characteristics: 
(1) it is instantaneous, (2) it occurs only when the radiation is above a cut-off frequency, 
and (3) kinetic energies of photoelectrons at the surface do not depend of the intensity 
of radiation. The photoelectric effect cannot be explained by classical theory. 

e We can explain the photoelectric effect by assuming that radiation consists of photons 
(particles of light). Each photon carries a quantum of energy. The energy of a photon 
depends only on its frequency, which is the frequency of the radiation. At the surface, 
the entire energy of a photon is transferred to one photoelectron. 

e The maximum kinetic energy of a photoelectron at the metal surface is the difference 
between the energy of the incident photon and the work function of the metal. The work 
function is the binding energy of electrons to the metal surface. Each metal has its own 
characteristic work function. 


Conceptual Questions 


Exercise: 


Problem: 


For the same monochromatic light source, would the photoelectric effect occur for all 
metals? 


Solution: 


No 


Exercise: 


Problem: 
In the interpretation of the photoelectric effect, how is it known that an electron does not 
absorb more than one photon? 
Exercise: 
Problem: 
Explain how you can determine the work function from a plot of the stopping potential 


versus the frequency of the incident radiation in a photoelectric effect experiment. Can 
you determine the value of Planck’s constant from this plot? 


Solution: 


from the slope 
Exercise: 
Problem: 
Suppose that in the photoelectric-effect experiment we make a plot of the detected 
current versus the applied potential difference. What information do we obtain from 


such a plot? Can we determine from it the value of Planck’s constant? Can we 
determine the work function of the metal? 


Exercise: 
Problem: 


Speculate how increasing the temperature of a photoelectrode affects the outcomes of 
the photoelectric effect experiment. 


Solution: 


Answers may vary 
Exercise: 


Problem: 


Which aspects of the photoelectric effect cannot be explained by classical physics? 
Exercise: 


Problem: 


Is the photoelectric effect a consequence of the wave character of radiation or is it a 
consequence of the particle character of radiation? Explain briefly. 


Solution: 


the particle character 
Exercise: 
Problem: 
The metals sodium, iron, and molybdenum have work functions 2.5 eV, 3.9 eV, and 4.2 


eV, respectively. Which of these metals will emit photoelectrons when illuminated with 
400 nm light? 


Problems 


Exercise: 


Problem: A photon has energy 20 keV. What are its frequency and wavelength? 
Solution: 


4.835 x 10'8 Hz; 0.620 A 
Exercise: 
Problem: 
The wavelengths of visible light range from approximately 400 to 750 nm. What is the 
corresponding range of photon energies for visible light? 
Exercise: 
Problem: 


What is the longest wavelength of radiation that can eject a photoelectron from silver? 
Is it in the visible range? 


Solution: 


263 nm; no 
Exercise: 
Problem: 
What is the longest wavelength of radiation that can eject a photoelectron from 
potassium, given the work function of potassium 2.24 eV? Is it in the visible range? 


Exercise: 


Problem: 


Estimate the binding energy of electrons in magnesium, given that the wavelength of 
337 nm is the longest wavelength that a photon may have to eject a photoelectron from 
magnesium photoelectrode. 


Solution: 


3.68 eV 
Exercise: 
Problem: 
The work function for potassium is 2.26 eV. What is the cutoff frequency when this 


metal is used as photoelectrode? What is the stopping potential when for the emitted 
electrons when this photoelectrode is exposed to radiation of frequency 1200 THz? 


Exercise: 
Problem: 
Estimate the work function of aluminum, given that the wavelength of 304 nm is the 


longest wavelength that a photon may have to eject a photoelectron from aluminum 
photoelectrode. 


Solution: 


4.09 eV 
Exercise: 


Problem: 


What is the maximum kinetic energy of photoelectrons ejected from sodium by the 
incident radiation of wavelength 450 nm? 


Exercise: 


Problem: 


A 120-nm UV radiation illuminates a gold-plated electrode. What is the maximum 
kinetic energy of the ejected photoelectrons? 


Solution: 


5.04 eV 


Exercise: 


Problem: 
A 400-nm violet light ejects photoelectrons with a maximum kinetic energy of 0.860 eV 
from sodium photoelectrode. What is the work function of sodium? 

Exercise: 
Problem: 
A 600-nm light falls on a photoelectric surface and electrons with the maximum kinetic 
energy of 0.17 eV are emitted. Determine (a) the work function and (b) the cutoff 


frequency of the surface. (c) What is the stopping potential when the surface is 
illuminated with light of wavelength 400 nm? 


Solution: 


a. 1.89 eV; b. 459 THz; c. 1.21 V 
Exercise: 
Problem: 
The cutoff wavelength for the emission of photoelectrons from a particular surface is 


500 nm. Find the maximum kinetic energy of the ejected photoelectrons when the 
surface is illuminated with light of wavelength 600 nm. 


Exercise: 
Problem: 


Find the wavelength of radiation that can eject 2.00-eV electrons from calcium 
electrode. The work function for calcium is 2.71 eV. In what range is this radiation? 


Solution: 


264 nm; UV 
Exercise: 
Problem: 
Find the wavelength of radiation that can eject 0.10-eV electrons from potassium 
electrode. The work function for potassium is 2.24 eV. In what range is this radiation? 
Exercise: 
Problem: 


Find the maximum velocity of photoelectrons ejected by an 80-nm radiation, if the work 
function of photoelectrode is 4.73 eV. 


Solution: 


1.95 x 10°m/s 


Glossary 


cut-off frequency 
frequency of incident light below which the photoelectric effect does not occur 


cut-off wavelength 
wavelength of incident light that corresponds to cut-off frequency 


energy of a photon 
quantum of radiant energy, depends only on a photon’s frequency 


photocurrent 
in a circuit, current that flows when a photoelectrode is illuminated 


photoelectric effect 
emission of electrons from a metal surface exposed to electromagnetic radiation of the 
proper frequency 


photoelectrode 
in a circuit, an electrode that emits photoelectrons 


photoelectron 
electron emitted from a metal surface in the presence of incident radiation 


photon 
particle of light 


quantum phenomenon 
in interaction with matter, photon transfers either all its energy or nothing 


stopping potential 
in a circuit, potential difference that stops photocurrent 


work function 
energy needed to detach photoelectron from the metal surface 


The Compton Effect 
By the end of this section, you will be able to: 


e Describe Compton’s experiment 

e Explain the Compton wavelength shift 

e Describe how experiments with X-rays confirm the particle nature of 
radiation 


Two of Einstein’s influential ideas introduced in 1905 were the theory of 
special relativity and the concept of a light quantum, which we now call a 
photon. Beyond 1905, Einstein went further to suggest that freely 
propagating electromagnetic waves consisted of photons that are particles 
of light in the same sense that electrons or other massive particles are 
particles of matter. A beam of monochromatic light of wavelength A (or 
equivalently, of frequency f) can be seen either as a classical wave or aS a 
collection of photons that travel in a vacuum with one speed, c (the speed of 
light), and all carrying the same energy, E' = hf. This idea proved useful 
for explaining the interactions of light with particles of matter. 


Momentum of a Photon 


Unlike a particle of matter that is characterized by its rest mass mo, a 
photon is massless. In a vacuum, unlike a particle of matter that may vary 
its speed but cannot reach the speed of light, a photon travels at only one 
speed, which is exactly the speed of light. From the point of view of 
Newtonian classical mechanics, these two characteristics imply that a 
photon should not exist at all. For example, how can we find the linear 
momentum or kinetic energy of a body whose mass is zero? This apparent 
paradox vanishes if we describe a photon as a relativistic particle. 
According to the theory of special relativity, any particle in nature obeys the 
relativistic energy equation 


Note: 
Equation: 


ee ee ee 


This relation can also be applied to a photon. In [link], EF is the total energy 
of a particle, p is its linear momentum, and 7p is its rest mass. For a 
photon, we simply set ™p = 0 in this equation. This leads to the expression 
for the momentum pf of a photon 


Note: 
Equation: 


ar 


Here the photon’s energy Fy is the same as that of a light quantum of 
frequency f, which we introduced to explain the photoelectric effect: 


Note: 
Equation: 


The wave relation that connects frequency f with wavelength A and speed c 
also holds for photons: 
Equation: 


Af =cx 


Therefore, a photon can be equivalently characterized by either its energy 
and wavelength, or its frequency and momentum. [link] and [link] can be 
combined into the explicit relation between a photon’s momentum and its 
wavelength: 


Note: 
Equation: 


S| Se 


es a 


Notice that this equation gives us only the magnitude of the photon’s 
momentum and contains no information about the direction in which the 
photon is moving. To include the direction, it is customary to write the 
photon’s momentum as a vector: 


Note: 
Equation: 


In [link], & = h / 27 is the reduced Planck’s constant (pronounced “h- 


— 
bar”), which is just Planck’s constant divided by the factor 277. Vector k is 
called the “wave vector” or propagation vector (the direction in which a 
photon is moving). The propagation vector shows the direction of the 


photon’s linear momentum vector. The magnitude of the wave vector is 


a; 


does not introduce any new physics. We can verify that the magnitude of 
the vector in [link] is the same as that given by [link]. 


= 2 /X and is called the wave number. Notice that this equation 


The Compton Effect 


The Compton effect is the term used for an unusual result observed when 
X-rays are scattered on some materials. By classical theory, when an 
electromagnetic wave is scattered off atoms, the wavelength of the scattered 
radiation is expected to be the same as the wavelength of the incident 
radiation. Contrary to this prediction of classical physics, observations show 
that when X-rays are scattered off some materials, such as graphite, the 
scattered X-rays have different wavelengths from the wavelength of the 
incident X-rays. This classically unexplainable phenomenon was studied 
experimentally by Arthur H. Compton and his collaborators, and Compton 
gave its explanation in 1923. 


To explain the shift in wavelengths measured in the experiment, Compton 
used Ejinstein’s idea of light as a particle. The Compton effect has a very 
important place in the history of physics because it shows that 
electromagnetic radiation cannot be explained as a purely wave 
phenomenon. The explanation of the Compton effect gave a convincing 
argument to the physics community that electromagnetic waves can indeed 
behave like a stream of photons, which placed the concept of a photon on 
firm ground. 


The schematics of Compton’s experimental setup are shown in [link]. The 
idea of the experiment is straightforward: Monochromatic X-rays with 
wavelength J are incident on a sample of graphite (the “target”), where they 
interact with atoms inside the sample; they later emerge as scattered X-rays 
with wavelength A/. A detector placed behind the target can measure the 
intensity of radiation scattered in any direction @ with respect to the 
direction of the incident X-ray beam. This scattering angle, 6, is the angle 
between the direction of the scattered beam and the direction of the incident 
beam. In this experiment, we know the intensity and the wavelength A of 


the incoming (incident) beam; and for a given scattering angle 0, we 
measure the intensity and the wavelength X/ of the outgoing (scattered) 
beam. Typical results of these measurements are shown in [link], where the 
x-axis is the wavelength of the scattered X-rays and the y-axis is the 
intensity of the scattered X-rays, measured for different scattering angles 
(indicated on the graphs). For all scattering angles (except for 0 = 0°), we 
measure two intensity peaks. One peak is located at the wavelength 4, 
which is the wavelength of the incident beam. The other peak is located at 
some other wavelength, A/. The two peaks are separated by AA, which 
depends on the scattering angle 0 of the outgoing beam (in the direction of 
observation). The separation A is called the Compton shift. 
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Experimental setup for studying Compton scattering. 
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Experimental data show the Compton effect for X-rays scattering off 
graphite at various angles: The intensity of the scattered beam has two 
peaks. One peak appears at the wavelength A of the incident radiation 
and the second peak appears at wavelength A’. The separation AA 
between the peaks depends on the scattering angle 0, which is the 
angular position of the detector in [link]. The experimental data in this 
figure are plotted in arbitrary units so that the height of the profile 
reflects the intensity of the scattered beam above background noise. 


Compton Shift 


As given by Compton, the explanation of the Compton shift is that in the 
target material, graphite, valence electrons are loosely bound in the atoms 
and behave like free electrons. Compton assumed that the incident X-ray 
radiation is a stream of photons. An incoming photon in this stream collides 
with a valence electron in the graphite target. In the course of this collision, 
the incoming photon transfers some part of its energy and momentum to the 
target electron and leaves the scene as a scattered photon. This model 
explains in qualitative terms why the scattered radiation has a longer 
wavelength than the incident radiation. Put simply, a photon that has lost 
some of its energy emerges as a photon with a lower frequency, or 
equivalently, with a longer wavelength. To show that his model was correct, 
Compton used it to derive the expression for the Compton shift. In his 
derivation, he assumed that both photon and electron are relativistic 
particles and that the collision obeys two commonsense principles: (1) the 


conservation of linear momentum and (2) the conservation of total 
relativistic energy. 


In the following derivation of the Compton shift, & - and Py denote the 
energy and momentum, respectively, of an incident photon with frequency 
f. The photon collides with a relativistic electron at rest, which means that 
immediately before the collision, the electron’s energy is entirely its rest 
mass energy, moc”. Immediately after the collision, the electron has energy 


E and momentum Dp, both of which satisfy [link]. Immediately after the 


collision, the outgoing photon has energy E f, Momentum ae and 
frequency f’. The direction of the incident photon is horizontal from left to 
right, and the direction of the outgoing photon is at the angle 0, as 
illustrated in [link]. The scattering angle @ is the angle between the 


— = 
momentum vectors py and pf, and we can write their scalar product: 
Equation: 


= z 
Py: Pf = pepfcosd. 


Following Compton’s argument, we assume that the colliding photon and 
electron form an isolated system. This assumption is valid for weakly 
bound electrons that, to a good approximation, can be treated as free 
particles. Our first equation is the conservation of energy for the photon- 
electron system: 

Equation: 


Ey+ moc? = E;+E. 


The left side of this equation is the energy of the system at the instant 
immediately before the collision, and the right side of the equation is the 
energy of the system at the instant immediately after the collision. Our 
second equation is the conservation of linear momentum for the photon— 
electron system where the electron is at rest at the instant immediately 
before the collision: 


Equation: 


> Fo 
Pf =Pyf +P. 


The left side of this equation is the momentum of the system right before 
the collision, and the right side of the equation is the momentum of the 
system right after collision. The entire physics of Compton scattering is 
contained in these three preceding equations—the remaining part is algebra. 
At this point, we could jump to the concluding formula for the Compton 
shift, but it is beneficial to highlight the main algebraic steps that lead to 
Compton’s formula, which we give here as follows. 


We start with rearranging the terms in [link] and squaring it: 
Equation: 


~ 2 
(Ey =— B;) + moc? | = fi’. 


In the next step, we substitute [link] for E77, simplify, and divide both sides 
by c? to obtain 
Equation: 


(B;/e—B;/e) + 2moe (Bye ~ B;/c) = p’. 


Now we can use [link] to express this form of the energy equation in terms 
of momenta. The result is 
Equation: 


2.19 ~ 
(py — By)” + 2moc (py — By) = dD”. 
To eliminate p, we turn to the momentum equation [link], rearrange its 


terms, and square it to obtain 
Equation: 


2 


2 
> Fg > 2 ~ > 2 
(B; -3;) =p’ and (8; -3;) = p++ Dd; — py: py. 


The product of the momentum vectors is given by [link]. When we 
substitute this result for p? in [link], we obtain the energy equation that 
contains the scattering angle 0 : 

Equation: 


mi 2 as ~ ae 
(pt — Bf)” + 2moc (ps — De) = py + Dy — 2 FP pcos. 


With further algebra, this result can be simplified to 
Equation: 


Now recall [link] and write: 1/py = A’/h and 1/py = A/h. When these 
relations are substituted into [link], we obtain the relation for the Compton 
shift: 

Equation: 


M-X = 


mie (1 — cos6). 


The factor h / moc is called the Compton wavelength of the electron: 


Note: 
Equation: 


h 
Ac = —— = 0.00243 nm = 2.43 pm. 
Moc 


Denoting the shift as AA = A/—A, the concluding result can be rewritten as 


Note: 
Equation: 


AA = X-(1 — cos6). 


This formula for the Compton shift describes outstandingly well the 
experimental results shown in [link]. Scattering data measured for 
molybdenum, graphite, calcite, and many other target materials are in 
accord with this theoretical result. The nonshifted peak shown in [link] is 
due to photon collisions with tightly bound inner electrons in the target 
material. Photons that collide with the inner electrons of the target atoms in 
fact collide with the entire atom. In this extreme case, the rest mass in [link] 
must be changed to the rest mass of the atom. This type of shift is four 
orders of magnitude smaller than the shift caused by collisions with 
electrons and is so small that it can be neglected. 


Compton scattering is an example of inelastic scattering, in which the 
scattered radiation has a longer wavelength than the wavelength of the 
incident radiation. In today’s usage, the term “Compton scattering” is used 
for the inelastic scattering of photons by free, charged particles. In Compton 
scattering, treating photons as particles with momenta that can be 
transferred to charged particles provides the theoretical background to 
explain the wavelength shifts measured in experiments; this is the evidence 
that radiation consists of photons. 


Example: 

Compton Scattering 

An incident 71-pm X-ray is incident on a calcite target. Find the 
wavelength of the X-ray scattered at a 30° angle. What is the largest shift 


that can be expected in this experiment? 

Strategy 

To find the wavelength of the scattered X-ray, first we must find the 
Compton shift for the given scattering angle, 9 = 30°. We use [link]. Then 
we add this shift to the incident wavelength to obtain the scattered 
wavelength. The largest Compton shift occurs at the angle 8 when 

1 — cos@ has the largest value, which is for the angle 9 = 180°. 

Solution 

The shift at 9 = 30° is 

Equation: 


AX = 4,(1 — cos30°) = 0.134A, = (0.134)(2.43) pm = 0.325 pm. 


This gives the scattered wavelength: 


Equation: 
Ml= + AX = (71 + 0.325) pm = 71.325 pm. 
The largest shift is 
Equation: 
(AX) max = Ac(1 — cos180°) = 2(2.43 pm) = 4.86 pm. 
Significance 


The largest shift in wavelength is detected for the backscattered radiation; 
however, most of the photons from the incident beam pass through the 
target and only a small fraction of photons gets backscattered (typically, 
less than 5%). Therefore, these measurements require highly sensitive 
detectors. 


Note: 
Exercise: 


Problem: 


Check Your Understanding An incident 71-pm X-ray is incident on 
a calcite target. Find the wavelength of the X-ray scattered at a 60° 
angle. What is the smallest shift that can be expected in this 
experiment? 


Solution: 


(AX), = 0m at a0° angle; 71.0 pm + 0.5A, = 72.215 pm 


min 


Summary 


e In the Compton effect, X-rays scattered off some materials have 
different wavelengths than the wavelength of the incident X-rays. This 
phenomenon does not have a classical explanation. 

¢ The Compton effect is explained by assuming that radiation consists of 
photons that collide with weakly bound electrons in the target material. 
Both electron and photon are treated as relativistic particles. 
Conservation laws of the total energy and of momentum are obeyed in 
collisions. 

e Treating the photon as a particle with momentum that can be 
transferred to an electron leads to a theoretical Compton shift that 
agrees with the wavelength shift measured in the experiment. This 
provides evidence that radiation consists of photons. 

e Compton scattering is an inelastic scattering, in which scattered 
radiation has a longer wavelength than that of incident radiation. 


Conceptual Questions 


Exercise: 


Problem: 


Discuss any similarities and differences between the photoelectric and 
the Compton effects. 


Solution: 


Answers may vary 
Exercise: 


Problem: 


Which has a greater momentum: an UV photon or an IR photon? 
Exercise: 

Problem: 

Does changing the intensity of a monochromatic light beam affect the 


momentum of the individual photons in the beam? Does such a change 
affect the net momentum of the beam? 


Solution: 


no; yes 
Exercise: 
Problem: 
Can the Compton effect occur with visible light? If so, will it be 
detectable? 
Exercise: 
Problem: 


Is it possible in the Compton experiment to observe scattered X-rays 
that have a shorter wavelength than the incident X-ray radiation? 


Solution: 


no 
Exercise: 


Problem: 


Show that the Compton wavelength has the dimension of length. 
Exercise: 


Problem: 


At what scattering angle is the wavelength shift in the Compton effect 
equal to the Compton wavelength? 


Solution: 


right angle 


Problems 
Exercise: 

Problem: What is the momentum of a 589-nm yellow photon? 
Exercise: 

Problem: What is the momentum of a 4-cm microwave photon? 


Solution: 


1.66 x 10° kg- m/s 
Exercise: 
Problem: 
In a beam of white light (wavelengths from 400 to 750 nm), what 
range of momentum can the photons have? 


Exercise: 


Problem: 


What is the energy of a photon whose momentum is 
3.0 x 10°-*4kg - m/s? 


Solution: 


56.21 eV 
Exercise: 


Problem: 


What is the wavelength of (a) a 12-keV X-ray photon; (b) a 2.0-MeV y 
-ray photon? 


Exercise: 


Problem: Find the momentum and energy of a 1.0-A photon. 
Solution: 


6.63 x 10°*% kg - m/s; 124 keV 
Exercise: 


Problem: 


Find the wavelength and energy of a photon with momentum 
5.00 x 10 kg - m/s. 


Exercise: 


Problem: 


A y-ray photon has a momentum of 8.00 x 10° *'kg - m/s. Find its 
wavelength and energy. 


Solution: 


82.9 fm; 15 MeV 


Exercise: 
Problem: 
(a) Calculate the momentum of a 2.5-ym photon. (b) Find the velocity 


of an electron with the same momentum. (c) What is the kinetic energy 
of the electron, and how does it compare to that of the photon? 


Exercise: 


Problem: 


Show that p = h/A and E+ = hf are consistent with the relativistic 


formula H? = p?c? + mc’. 


Solution: 


(Proof) 
Exercise: 

Problem: 

Show that the energy E in eV of a photon is given by 

E =1.241 x 10 °eV-m/4A, where 2 is its wavelength in meters. 
Exercise: 

Problem: 

For collisions with free electrons, compare the Compton shift of a 


photon scattered as an angle of 30° to that of a photon scattered at 
AD. %; 


Solution: 


AX39 / Ads = 45.74 % 


Exercise: 


Problem: 


X-rays of wavelength 12.5 pm are scattered from a block of carbon. 
What are the wavelengths of photons scattered at (a) 30°; (b) 90°; 
and, (c) 180° ? 


Glossary 


Compton effect 


the change in wavelength when an X-ray is scattered by its interaction 
with some materials 


Compton shift 


difference between the wavelengths of the incident X-ray and the 
scattered X-ray 


Compton wavelength 
physical constant with the value A, = 2.43 pm 


inelastic scattering 


scattering effect where kinetic energy is not conserved but the total 
energy is conserved 


propagation vector 


vector with magnitude 27 / X that has the direction of the photon’s 
linear momentum 


reduced Planck’s constant 


Planck’s constant divided by 27 


scattering angle 
angle between the direction of the scattered beam and the direction of 
the incident beam 


wave number 


magnitude of the propagation vector 


Bohr’s Model of the Hydrogen Atom 
By the end of this section, you will be able to: 


e Explain the difference between the absorption spectrum and the emission spectrum of radiation 
emitted by atoms 

e Describe the Rutherford gold foil experiment and the discovery of the atomic nucleus 

e Explain the atomic structure of hydrogen 

e Describe the postulates of the early quantum theory for the hydrogen atom 

e Summarize how Bohr’s quantum model of the hydrogen atom explains the radiation spectrum of 
atomic hydrogen 


Historically, Bohr’s model of the hydrogen atom is the very first model of atomic structure that 
correctly explained the radiation spectra of atomic hydrogen. The model has a special place in the 
history of physics because it introduced an early quantum theory, which brought about new 
developments in scientific thought and later culminated in the development of quantum mechanics. To 
understand the specifics of Bohr’s model, we must first review the nineteenth-century discoveries that 
prompted its formulation. 


When we use a prism to analyze white light coming from the sun, several dark lines in the solar 
spectrum are observed ([link]). Solar absorption lines are called Fraunhofer lines after Joseph von 
Fraunhofer, who accurately measured their wavelengths. During 1854-1861, Gustav Kirchhoff and 
Robert Bunsen discovered that for the various chemical elements, the line emission spectrum of an 
element exactly matches its line absorption spectrum. The difference between the absorption spectrum 
and the emission spectrum is explained in [link]. An absorption spectrum is observed when light passes 
through a gas. This spectrum appears as black lines that occur only at certain wavelengths on the 
background of the continuous spectrum of white light ([link]). The missing wavelengths tell us which 
wavelengths of the radiation are absorbed by the gas. The emission spectrum is observed when light is 
emitted by a gas. This spectrum is seen as colorful lines on the black background (see [link] and [link]). 
Positions of the emission lines tell us which wavelengths of the radiation are emitted by the gas. Each 
chemical element has its own characteristic emission spectrum. For each element, the positions of its 
emission lines are exactly the same as the positions of its absorption lines. This means that atoms of a 
specific element absorb radiation only at specific wavelengths and radiation that does not have these 
wavelengths is not absorbed by the element at all. This also means that the radiation emitted by atoms 


of each element has exactly the same wavelengths as the radiation they absorb. 
KH G F E D Cc B A 


4000 4500 5000 5500 6000 6500 7000 7500 


In the solar emission spectrum in the visible range from 380 nm to 710 nm, Fraunhofer lines are 
observed as vertical black lines at specific spectral positions in the continuous spectrum. Highly 
sensitive modern instruments observe thousands of such lines. 


Flask 
with 
| gas 


Screen 


Absorption 
lines 


(a) 


Emission 
lines 


(b) 


Observation of line spectra: (a) setup to observe absorption lines; (b) setup to observe 
emission lines. (a) White light passes through a cold gas that is contained in a glass 
flask. A prism is used to separate wavelengths of the passed light. In the spectrum of 
the passed light, some wavelengths are missing, which are seen as black absorption 
lines in the continuous spectrum on the viewing screen. (b) A gas is contained in a 
glass discharge tube that has electrodes at its ends. At a high potential difference 
between the electrodes, the gas glows and the light emitted from the gas passes 
through the prism that separates its wavelengths. In the spectrum of the emitted light, 
only specific wavelengths are present, which are seen as colorful emission lines on 
the screen. 


The emission spectrum of atomic hydrogen: The spectral positions of emission lines are 
characteristic for hydrogen atoms. (credit: “Merikanto”/Wikimedia Commons) 


The emission spectrum of atomic iron: The spectral positions of emission lines are characteristic 
for iron atoms. 


Emission spectra of the elements have complex structures; they become even more complex for 
elements with higher atomic numbers. The simplest spectrum, shown in [link], belongs to the hydrogen 


atom. Only four lines are visible to the human eye. As you read from right to left in [link], these lines 
are: red (656 nm), called the H-a line; aqua (486 nm), blue (434 nm), and violet (410 nm). The lines 
with wavelengths shorter than 400 nm appear in the ultraviolet part of the spectrum ([link], far left) and 
are invisible to the human eye. There are infinitely many invisible spectral lines in the series for 
hydrogen. 


An empirical formula to describe the positions (wavelengths) A of the hydrogen emission lines in this 
series was discovered in 1885 by Johann Balmer. It is known as the Balmer formula: 


Note: 
Equation: 


The constant Ry = 1.09737 x 10’m~ is called the Rydberg constant for hydrogen. In [link], the 
positive integer n takes on values n = 3,4, 5, 6 for the four visible lines in this series. The series of 
emission lines given by the Balmer formula is called the Balmer series for hydrogen. Other emission 
lines of hydrogen that were discovered in the twentieth century are described by the Rydberg formula, 
which summarizes all of the experimental data: 


Note: 
Equation: 


2 
a 


1) 1 
= Ry (3 — +) wheren; = np +1,n¢+2,n¢+3,... 
f 


When n+ = 1, the series of spectral lines is called the Lyman series. When n+ = 2, the series is called 
the Balmer series, and in this case, the Rydberg formula coincides with the Balmer formula. When 

ns = 3, the series is called the Paschen series. When n = 4, the series is called the Brackett series. 
When 7 = 5, the series is called the Pfund series. When n+ = 6, we have the Humphreys series. As 
you may guess, there are infinitely many such spectral bands in the spectrum of hydrogen because n ¢ 
can be any positive integer number. 


The Rydberg formula for hydrogen gives the exact positions of the spectral lines as they are observed in 
a laboratory; however, at the beginning of the twentieth century, nobody could explain why it worked 
so well. The Rydberg formula remained unexplained until the first successful model of the hydrogen 
atom was proposed in 1913. 


Example: 

Limits of the Balmer Series 

Calculate the longest and the shortest wavelengths in the Balmer series. 

Strategy 

We can use either the Balmer formula or the Rydberg formula. The longest wavelength is obtained 
when 1 /n, is largest, which is when n; = n ¢ +1 = 3, because nf = 2 for the Balmer series. The 
smallest wavelength is obtained when 1 /n,; is smallest, which is 1/n; — 0 when n; — oo. 
Solution 

The long-wave limit: 


Equation: 
1 1 1 Tyre 1 
— = Ry | — — — } = (1.09737 x 107)—| — - —] > rA= 656.3nm 
Ne ee m\4 9 
The short-wave limit: 
Equation: 
1 1 7 1 fil 
Significance 


Note that there are infinitely many spectral lines lying between these two limits. 


Note: 
Exercise: 


Problem: 


Check Your Understanding What are the limits of the Lyman series? Can you see these spectral 
lines? 


Solution: 


121.5 nm and 91.1 nm; no, these spectral bands are in the ultraviolet 


The key to unlocking the mystery of atomic spectra is in understanding atomic structure. Scientists have 
long known that matter is made of atoms. According to nineteenth-century science, atoms are the 
smallest indivisible quantities of matter. This scientific belief was shattered by a series of 
groundbreaking experiments that proved the existence of subatomic particles, such as electrons, 
protons, and neutrons. 


The electron was discovered and identified as the smallest quantity of electric charge by J.J. Thomson 
in 1897 in his cathode ray experiments, also known as B-ray experiments: A B-ray is a beam of 
electrons. In 1904, Thomson proposed the first model of atomic structure, known as the “plum 
pudding” model, in which an atom consisted of an unknown positively charged matter with negative 
electrons embedded in it like plums in a pudding. Around 1900, E. Rutherford, and independently, Paul 
Ulrich Villard, classified all radiation known at that time as a-rays, B-rays, and y-rays (a y-ray is a 


beam of highly energetic photons). In 1907, Rutherford and Thomas Royds used spectroscopy methods 
to show that positively charged particles of a-radiation (called a-particles) are in fact doubly ionized 
atoms of helium. In 1909, Rutherford, Ernest Marsden, and Hans Geiger used a-particles in their 
famous scattering experiment that disproved Thomson’s model (see Linear Momentum and Collisions). 


In the Rutherford gold foil experiment (also known as the Geiger—Marsden experiment), a-particles 
were incident on a thin gold foil and were scattered by gold atoms inside the foil (see Types of 
Collisions). The outgoing particles were detected by a 360° scintillation screen surrounding the gold 
target (for a detailed description of the experimental setup, see Linear Momentum and Collisions). 
When a scattered particle struck the screen, a tiny flash of light (scintillation) was observed at that 
location. By counting the scintillations seen at various angles with respect to the direction of the 
incident beam, the scientists could determine what fraction of the incident particles were scattered and 
what fraction were not deflected at all. If the plum pudding model were correct, there would be no 
back-scattered a-particles. However, the results of the Rutherford experiment showed that, although a 
sizable fraction of a-particles emerged from the foil not scattered at all as though the foil were not in 
their way, a significant fraction of a-particles were back-scattered toward the source. This kind of result 
was possible only when most of the mass and the entire positive charge of the gold atom were 
concentrated in a tiny space inside the atom. 


In 1911, Rutherford proposed a nuclear model of the atom. In Rutherford’s model, an atom contained 
a positively charged nucleus of negligible size, almost like a point, but included almost the entire mass 
of the atom. The atom also contained negative electrons that were located within the atom but relatively 
far away from the nucleus. Ten years later, Rutherford coined the name proton for the nucleus of 
hydrogen and the name neutron for a hypothetical electrically neutral particle that would mediate the 
binding of positive protons in the nucleus (the neutron was discovered in 1932 by James Chadwick). 
Rutherford is credited with the discovery of the atomic nucleus; however, the Rutherford model of 
atomic structure does not explain the Rydberg formula for the hydrogen emission lines. 


Bohr’s model of the hydrogen atom, proposed by Niels Bohr in 1913, was the first quantum model 
that correctly explained the hydrogen emission spectrum. Bohr’s model combines the classical 
mechanics of planetary motion with the quantum concept of photons. Once Rutherford had established 
the existence of the atomic nucleus, Bohr’s intuition that the negative electron in the hydrogen atom 
must revolve around the positive nucleus became a logical consequence of the inverse-square-distance 
law of electrostatic attraction. Recall that Coulomb’s law describing the attraction between two opposite 
charges has a similar form to Newton’s universal law of gravitation in the sense that the gravitational 
force and the electrostatic force are both decreasing as 1 /r?, where r is the separation distance between 
the bodies. In the same way as Earth revolves around the sun, the negative electron in the hydrogen 
atom can revolve around the positive nucleus. However, an accelerating charge radiates its energy. 
Classically, if the electron moved around the nucleus in a planetary fashion, it would be undergoing 
centripetal acceleration, and thus would be radiating energy that would cause it to spiral down into the 
nucleus. Such a planetary hydrogen atom would not be stable, which is contrary to what we know about 
ordinary hydrogen atoms that do not disintegrate. Moreover, the classical motion of the electron is not 
able to explain the discrete emission spectrum of hydrogen. 


To circumvent these two difficulties, Bohr proposed the following three postulates of Bohr’s model: 


1. The negative electron moves around the positive nucleus (proton) in a circular orbit. All electron 
orbits are centered at the nucleus. Not all classically possible orbits are available to an electron 
bound to the nucleus. 

2. The allowed electron orbits satisfy the first quantization condition: In the nth orbit, the angular 
momentum L,, of the electron can take only discrete values: 


Note: 
Equation: 


Ji. = Diy WANTOD = I, BB o a 


This postulate says that the electron’s angular momentum is quantized. Denoted by r,, and vp, 
respectively, the radius of the nth orbit and the electron’s speed in it, the first quantization 
condition can be expressed explicitly as 

Equation: 


MeVnln = Nh. 


3. An electron is allowed to make transitions from one orbit where its energy is E,, to another orbit 
where its energy is 7. When an atom absorbs a photon, the electron makes a transition to a 
higher-energy orbit. When an atom emits a photon, the electron transits to a lower-energy orbit. 
Electron transitions with the simultaneous photon absorption or photon emission take place 
instantaneously. The allowed electron transitions satisfy the second quantization condition: 


Note: 
Equation: 


hf = |En as En 


where hf is the energy of either an emitted or an absorbed photon with frequency f. The second 
quantization condition states that an electron’s change in energy in the hydrogen atom is quantized. 


These three postulates of the early quantum theory of the hydrogen atom allow us to derive not only the 
Rydberg formula, but also the value of the Rydberg constant and other important properties of the 
hydrogen atom such as its energy levels, its ionization energy, and the sizes of electron orbits. Note that 
in Bohr’s model, along with two nonclassical quantization postulates, we also have the classical 
description of the electron as a particle that is subjected to the Coulomb force, and its motion must obey 
Newton’s laws of motion. The hydrogen atom, as an isolated system, must obey the laws of 
conservation of energy and momentum in the way we know from classical physics. Having this 
theoretical framework in mind, we are ready to proceed with our analysis. 


Electron Orbits 


To obtain the size r,, of the electron’s nth orbit and the electron’s speed v,, in it, we turn to Newtonian 
mechanics. As a charged particle, the electron experiences an electrostatic pull toward the positively 
charged nucleus in the center of its circular orbit. This electrostatic pull is the centripetal force that 
causes the electron to move in a circle around the nucleus. Therefore, the magnitude of centripetal force 
is identified with the magnitude of the electrostatic force: 


Equation: 


Mev 1 e? 


Tt, Amey v2’ 
Here, e denotes the value of the elementary charge. The negative electron and positive proton have the 
same value of charge, 


la| =e. 


When [link] is combined with the first quantization condition given by [link], we can solve for the 
speed, v,,, and for the radius, r, : 


Equation: 
1 el 
Un = = 
""" Aneg Aon 
Equation: 
h2 
Tn = ATE n’. 
Mee 


Note that these results tell us that the electron’s speed as well as the radius of its orbit depend only on 
the index n that enumerates the orbit because all other quantities in the preceding equations are 
fundamental constants. We see from [link] that the size of the orbit grows as the square of n. This 
means that the second orbit is four times as large as the first orbit, and the third orbit is nine times as 
large as the first orbit, and so on. We also see from [link] that the electron’s speed in the orbit decreases 
as the orbit size increases. The electron’s speed is largest in the first Bohr orbit, for m = 1, which is the 
orbit closest to the nucleus. The radius of the first Bohr orbit is called the Bohr radius of hydrogen, 
denoted as apo. Its value is obtained by setting n = 1 in [link]: 


Note: 
Equation: 


2 
— 5.29 x 101m = 0.529 A. 


ao = 4 €o 5 
Mee 


We can substitute @g in [link] to express the radius of the nth orbit in terms of ag : 


Note: 
Equation: 


i aon’. 


This result means that the electron orbits in hydrogen atom are quantized because the orbital radius 
takes on only specific values of ag, 4a9, 9ao, 16a9, . .. given by [link], and no other values are allowed. 


Electron Energies 


The total energy E,, of an electron in the nth orbit is the sum of its kinetic energy K, and its 
electrostatic potential energy U,,. Utilizing [link], we find that 
Equation: 


Ky, = | = : mee! 1 . 
2 32n7%62 fi? on? 


Recall that the electrostatic potential energy of interaction between two charges q; and qo that are 
separated by a distance rj2 is (1 / 47€0)qig2 /riz. Here, qi = +e is the charge of the nucleus in the 
hydrogen atom (the charge of the proton), gz = —e is the charge of the electron and rjz = r,, is the 
radius of the nth orbit. Now we use [link] to find the potential energy of the electron: 

Equation: 


Vy. = 1 e? _ 1 m-e* 1 
"" Aneg Th 16m? A? n2- 


The total energy of the electron is the sum of [link] and [link]: 
Equation: 


1 m-e* 1 
327762 R22 2° 


Note that the energy depends only on the index n because the remaining symbols in [link] are physical 
constants. The value of the constant factor in [link] is 


Note: 
Equation: 


1 eet 1 meet 
= me = OE 8217 x 10-8 J = 13.6 eV. 
B2me, hi? 82h? 


It is convenient to express the electron’s energy in the nth orbit in terms of this energy, as 


Note: 
Equation: 


i hee 


Now we can see that the electron energies in the hydrogen atom are quantized because they can have 
only discrete values of —E, —E /4, —Eo /9, —Eo /16,... given by [link], and no other energy 
values are allowed. This set of allowed electron energies is called the energy spectrum of hydrogen 
([link]). The index n that enumerates energy levels in Bohr’s model is called the energy quantum 
number. We identify the energy of the electron inside the hydrogen atom with the energy of the 
hydrogen atom. Note that the smallest value of energy is obtained for n = 1, so the hydrogen atom 
cannot have energy smaller than that. This smallest value of the electron energy in the hydrogen atom is 
called the ground state energy of the hydrogen atom and its value is 


Note: 
Equation: 


Ey = —Eo = —13.6 eV. 


The hydrogen atom may have other energies that are higher than the ground state. These higher energy 
states are known as excited energy states of a hydrogen atom. 


There is only one ground state, but there are infinitely many excited states because there are infinitely 
many values of n in [link]. We say that the electron is in the “first exited state” when its energy is Ez 
(when n = 2), the second excited state when its energy is #3 (when n = 3) and, in general, in the nth 
exited state when its energy is /,,41. There is no highest-of-all excited state; however, there is a limit to 
the sequence of excited states. If we keep increasing n in [link], we find that the limit is 


—lim Ep /n? = 0. In this limit, the electron is no longer bound to the nucleus but becomes a free 
n—-co 


electron. An electron remains bound in the hydrogen atom as long as its energy is negative. An electron 
that orbits the nucleus in the first Bohr orbit, closest to the nucleus, is in the ground state, where its 
energy has the smallest value. In the ground state, the electron is most strongly bound to the nucleus 
and its energy is given by [link]. If we want to remove this electron from the atom, we must supply it 
with enough energy, /,, to at least balance out its ground state energy EF : 

Equation: 


Both =0=> En =—-Ek, = —(—Ep) = Ky = 13.6eV. 


The energy that is needed to remove the electron from the atom is called the ionization energy. The 
ionization energy /,, that is needed to remove the electron from the first Bohr orbit is called the 
ionization limit of the hydrogen atom. The ionization limit in [link] that we obtain in Bohr’s model 
agrees with experimental value. 


Lyman series 


The energy spectrum of the hydrogen atom. Energy 
levels (horizontal lines) represent the bound states of an 
electron in the atom. There is only one ground state, 
mn = 1, and infinite quantized excited states. The states 
are enumerated by the quantum number 
n= 1,2,3,4,.... Vertical lines illustrate the allowed 
electron transitions between the states. Downward 
arrows illustrate transitions with an emission of a 
photon with a wavelength in the indicated spectral 
band. 


Spectral Emission Lines of Hydrogen 


To obtain the wavelengths of the emitted radiation when an electron makes a transition from the nth 
orbit to the mth orbit, we use the second of Bohr’s quantization conditions and [link] for energies. The 
emission of energy from the atom can occur only when an electron makes a transition from an excited 
state to a lower-energy state. In the course of such a transition, the emitted photon carries away the 
difference of energies between the states involved in the transition. The transition cannot go in the other 
direction because the energy of a photon cannot be negative, which means that for emission we must 
have F,, > Em andn > m. Therefore, the third of Bohr’s postulates gives 

Equation: 


hf = 


Now we express the photon’s energy in terms of its wavelength, hf = hc/X, and divide both sides of 
[link] by he. The result is 

1 Ef 1 1 

A he \m? nn?) 


Equation: 
The value of the constant in this equation is 
Equation: 


oS oe = 1.097 x 107. 
he = (4.136 x 10-eV -s)(2.997 x 10°m/s) m 


This value is exactly the Rydberg constant Ry in the Rydberg heuristic formula [link]. In fact, [link] is 
identical to the Rydberg formula, because for a given m, we haven = m-+1,m-+ 2,.... In this way, 
the Bohr quantum model of the hydrogen atom allows us to derive the experimental Rydberg constant 
from first principles and to express it in terms of fundamental constants. Transitions between the 
allowed electron orbits are illustrated in [link]. 


We can repeat the same steps that led to [link] to obtain the wavelength of the absorbed radiation; this 
again gives [link] but this time for the positions of absorption lines in the absorption spectrum of 
hydrogen. The only difference is that for absorption, the quantum number m is the index of the orbit 
occupied by the electron before the transition (lower-energy orbit) and the quantum number n is the 
index of the orbit to which the electron makes the transition (higher-energy orbit). The difference 
between the electron energies in these two orbits is the energy of the absorbed photon. 


Example: 

Size and Ionization Energy of the Hydrogen Atom in an Excited State 

If a hydrogen atom in the ground state absorbs a 93.7-nm photon, corresponding to a transition line in 
the Lyman series, how does this affect the atom’s energy and size? How much energy is needed to 
ionize the atom when it is in this excited state? Give your answers in absolute units, and relative to the 
ground state. 


Strategy 

Before the absorption, the atom is in its ground state. This means that the electron transition takes 
place from the orbit m = 1 to some higher nth orbit. First, we must determine n for the absorbed 
wavelength AX = 93.7 nm. Then, we can use [link] to find the energy FE, of the excited state and its 
ionization energy EF. .n, and use [link] to find the radius r,, of the atom in the excited state. To 
estimate n, we use [link]. 

Solution 

Substitute m = 1 and A = 93.7 nm in [link] and solve for n. You should not expect to obtain a perfect 
integer answer because of rounding errors, but your answer will be close to an integer, and you can 
estimate n by taking the integral part of your answer: 

Equation: 


1 1 1 1 1 
= Sle = | =n= = 
Xr 12 n2 1 1 1 1 
~ Ru (93.7 x 10-°m)(1.097 x 10’m-!) 


The radius of the n = 6 orbit is 
Equation: 


= 6.07 > n=6. 


Tn = aon” = ag6” = 36a9 = 36(0.529 x 10°'°m) = 19.04 x 10°'m ~ 19.0A. 


Thus, after absorbing the 93.7-nm photon, the size of the hydrogen atom in the excited n = 6 state is 
36 times larger than before the absorption, when the atom was in the ground state. The energy of the 
fifth excited state (n = 6) is: 

Equation: 


Eo Eo Eo 13.6 eV 
= = —— = ——___ & 0.378 eV. 

2 62 36 36 

After absorbing the 93.7-nm photon, the energy of the hydrogen atom is larger than it was before the 

absorption. Ionization of the atom when it is in the fifth excited state (n = 6) requites 36 times less 

energy than is needed when the atom is in the ground state: 

Equation: 


Ex = —E¢ = —(—0.378 eV) = 0.378 eV. 


Significance 

We can analyze any spectral line in the spectrum of hydrogen in the same way. Thus, the experimental 
measurements of spectral lines provide us with information about the atomic structure of the hydrogen 
atom. 


Note: 
Exercise: 


Problem: 
Check Your Understanding When an electron in a hydrogen atom is in the first excited state, 


what prediction does the Bohr model give about its orbital speed and kinetic energy? What is the 
magnitude of its orbital angular momentum? 


Solution: 


v2 =1.1 x 10°m/s & 0.0036c; Le = 2h Kz =3.4eV 


Bohr’s model of the hydrogen atom also correctly predicts the spectra of some hydrogen-like ions. 
Hydrogen-like ions are atoms of elements with an atomic number Z larger than one (Z = 1 for 
hydrogen) but with all electrons removed except one. For example, an electrically neutral helium atom 
has an atomic number Z = 2. This means it has two electrons orbiting the nucleus with a charge of 

q = +Ze. When one of the orbiting electrons is removed from the helium atom (we say, when the 
helium atom is singly ionized), what remains is a hydrogen-like atomic structure where the remaining 
electron orbits the nucleus with a charge of gq = +Ze. This type of situation is described by the Bohr 
model. Assuming that the charge of the nucleus is not +e but + Ze, we can repeat all steps, beginning 
with [link], to obtain the results for a hydrogen-like ion: 


Note: 
Equation: 
ao 
Thr = vA : 

where ag is the Bohr orbit of hydrogen, and 
Note: 
Equation: 

1 

E, = —Z°Ey— 
n 


where £7 is the ionization limit of a hydrogen atom. These equations are good approximations as long 
as the atomic number Z is not too large. 


The Bohr model is important because it was the first model to postulate the quantization of electron 
orbits in atoms. Thus, it represents an early quantum theory that gave a start to developing modern 
quantum theory. It introduced the concept of a quantum number to describe atomic states. The 
limitation of the early quantum theory is that it cannot describe atoms in which the number of electrons 
orbiting the nucleus is larger than one. The Bohr model of hydrogen is a semi-classical model because 
it combines the classical concept of electron orbits with the new concept of quantization. The 
remarkable success of this model prompted many physicists to seek an explanation for why such a 
model should work at all, and to seek an understanding of the physics behind the postulates of early 
quantum theory. This search brought about the onset of an entirely new concept of “matter waves.” 


Summary 


e Positions of absorption and emission lines in the spectrum of atomic hydrogen are given by the 
experimental Rydberg formula. Classical physics cannot explain the spectrum of atomic hydrogen. 

e The Bohr model of hydrogen was the first model of atomic structure to correctly explain the 
radiation spectra of atomic hydrogen. It was preceded by the Rutherford nuclear model of the 
atom. In Rutherford’s model, an atom consists of a positively charged point-like nucleus that 
contains almost the entire mass of the atom and of negative electrons that are located far away 
from the nucleus. 

¢ Bohr’s model of the hydrogen atom is based on three postulates: (1) an electron moves around the 
nucleus in a circular orbit, (2) an electron’s angular momentum in the orbit is quantized, and (3) 
the change in an electron’s energy as it makes a quantum jump from one orbit to another is always 
accompanied by the emission or absorption of a photon. Bohr’s model is semi-classical because it 
combines the classical concept of electron orbit (postulate 1) with the new concept of quantization 
(postulates 2 and 3). 

e Bohr’s model of the hydrogen atom explains the emission and absorption spectra of atomic 
hydrogen and hydrogen-like ions with low atomic numbers. It was the first model to introduce the 
concept of a quantum number to describe atomic states and to postulate quantization of electron 
orbits in the atom. Bohr’s model is an important step in the development of quantum mechanics, 
which deals with many-electron atoms. 


Conceptual Questions 


Exercise: 
Problem: 
Explain why the patterns of bright emission spectral lines have an identical spectral position to the 
pattern of dark absorption spectral lines for a given gaseous element. 


Exercise: 


Problem: Do the various spectral lines of the hydrogen atom overlap? 


Solution: 


no 
Exercise: 
Problem: 
The Balmer series for hydrogen was discovered before either the Lyman or the Paschen series. 
Why? 
Exercise: 
Problem: 
When the absorption spectrum of hydrogen at room temperature is analyzed, absorption lines for 


the Lyman series are found, but none are found for the Balmer series. What does this tell us about 
the energy state of most hydrogen atoms at room temperature? 


Solution: 


They are at ground state. 

Exercise: 
Problem: 
Hydrogen accounts for about 75% by mass of the matter at the surfaces of most stars. However, 
the absorption lines of hydrogen are strongest (of highest intensity) in the spectra of stars with a 
surface temperature of about 9000 K. They are weaker in the sun spectrum and are essentially 
nonexistent in very hot (temperatures above 25,000 K) or rather cool (temperatures below 3500 K) 


stars. Speculate as to why surface temperature affects the hydrogen absorption lines that we 
observe. 


Exercise: 
Problem: 


Discuss the similarities and differences between Thomson’s model of the hydrogen atom and 
Bohr’s model of the hydrogen atom. 


Solution: 


Answers may vary 
Exercise: 
Problem: 
Discuss the way in which Thomson’s model is nonphysical. Support your argument with 
experimental evidence. 
Exercise: 
Problem: 


If, in a hydrogen atom, an electron moves to an orbit with a larger radius, does the energy of the 
hydrogen atom increase or decrease? 


Solution: 


increase 
Exercise: 
Problem: 
How is the energy conserved when an atom makes a transition from a higher to a lower energy 
state? 
Exercise: 
Problem: 


Suppose an electron in a hydrogen atom makes a transition from the (n+1)th orbit to the nth orbit. 
Is the wavelength of the emitted photon longer for larger values of n, or for smaller values of n? 


Solution: 


for larger n 


Exercise: 


Problem: Discuss why the allowed energies of the hydrogen atom are negative. 


Exercise: 


Problem: Can a hydrogen atom absorb a photon whose energy is greater than 13.6 eV? 


Solution: 
Yes, the excess of 13.6 eV will become kinetic energy of a free electron. 


Exercise: 


Problem: Why can you see through glass but not through wood? 


Exercise: 


Problem: Do gravitational forces have a significant effect on atomic energy levels? 


Solution: 
no 


Exercise: 


Problem: Show that Planck’s constant has the dimensions of angular momentum. 


Problems 


Exercise: 


Problem: 


Calculate the wavelength of the first line in the Lyman series and show that this line lies in the 
ultraviolet part of the spectrum. 


Solution: 


121.5 nm 
Exercise: 
Problem: 
Calculate the wavelength of the fifth line in the Lyman series and show that this line lies in the 
ultraviolet part of the spectrum. 


Exercise: 


Problem: 


Calculate the energy changes corresponding to the transitions of the hydrogen atom: (a) from 
n= 3ton = 4; (b) fromn = 2 ton = 1; and(c) fromn = 3ton=o. 


Solution: 


a. 0.661 eV; b. —10.2 eV; c. 1.511 eV 


Exercise: 


Problem: Determine the wavelength of the third Balmer line (transition from n = 5 to n = 2). 
Exercise: 
Problem: 


What is the frequency of the photon absorbed when the hydrogen atom makes the transition from 
the ground state to the n = 4 state? 


Solution: 


3038 THz 
Exercise: 
Problem: 
When a hydrogen atom is in its ground state, what are the shortest and longest wavelengths of the 
photons it can absorb without being ionized? 
Exercise: 
Problem: 


When a hydrogen atom is in its third excided state, what are the shortest and longest wavelengths 
of the photons it can emit? 


Solution: 


97.33 nm 
Exercise: 
Problem: 
What is the longest wavelength that light can have if it is to be capable of ionizing the hydrogen 
atom in its ground state? 
Exercise: 
Problem: 


For an electron in a hydrogen atom in the n = 2 state, compute: (a) the angular momentum; (b) the 
kinetic energy; (c) the potential energy; and (d) the total energy. 


Solution: 


a. h/7; b. 3.4 eV; c. — 6.8 eV; d.- 3.4 eV 


Exercise: 


Problem: Find the ionization energy of a hydrogen atom in the fourth energy state. 
Exercise: 


Problem: 


It has been measured that it required 0.850 eV to remove an electron from the hydrogen atom. In 
what state was the atom before the ionization happened? 


Solution: 


n=A4 


Exercise: 


Problem: What is the radius of a hydrogen atom when the electron is in the first excited state? 
Exercise: 


Problem: 


Find the shortest wavelength in the Balmer series. In what part of the spectrum does this line lie? 
Solution: 


365 nm; UV 


Exercise: 


Problem: Show that the entire Paschen series lies in the infrared part of the spectrum. 
Exercise: 
Problem: 


Do the Balmer series and the Lyman series overlap? Why? Why not? (Hint: calculate the shortest 
Balmer line and the longest Lyman line.) 


Solution: 


no 

Exercise: 
Problem: 
(a) Which line in the Balmer series is the first one in the UV part of the spectrum? (b) How many 
Balmer lines lie in the visible part of the spectrum? (c) How many Balmer lines lie in the UV? 


Exercise: 


Problem: 


A 4.653-1m emission line of atomic hydrogen corresponds to transition between the states ns = 5 
and n;. Find n;. 


Solution: 


7 


Glossary 


absorption spectrum 
wavelengths of absorbed radiation by atoms and molecules 


a-particle 
doubly ionized helium atom 


a-ray 
beam of a-particles (alpha-particles) 


Balmer formula 
describes the emission spectrum of a hydrogen atom in the visible-light range 


Balmer series 
spectral lines corresponding to electron transitions to/from the n = 2 state of the hydrogen atom, 
described by the Balmer formula 


B-ray 
beam of electrons 


Bohr radius of hydrogen 
radius of the first Bohr’s orbit 


Bohr’s model of the hydrogen atom 
first quantum model to explain emission spectra of hydrogen 


Brackett series 
spectral lines corresponding to electron transitions to/from the n = 4 state 


emission spectrum 
wavelengths of emitted radiation by atoms and molecules 


energy spectrum of hydrogen 
set of allowed discrete energies of an electron in a hydrogen atom 


excited energy states of the H atom 
energy state other than the ground state 


Fraunhofer lines 
dark absorption lines in the continuum solar emission spectrum 


y-Tay 
beam of highly energetic photons 


ground state energy of the hydrogen atom 
energy of an electron in the first Bohr orbit of the hydrogen atom 


Humphreys series 
spectral lines corresponding to electron transitions to/from the n = 6 state 


hydrogen-like atom 
ionized atom with one electron remaining and nucleus with charge + Ze 


ionization energy 
energy needed to remove an electron from an atom 


ionization limit of the hydrogen atom 
ionization energy needed to remove an electron from the first Bohr orbit 


Lyman series 
spectral lines corresponding to electron transitions to/from the ground state 


nuclear model of the atom 
heavy positively charged nucleus at the center is surrounded by electrons, proposed by Rutherford 


Paschen series 
spectral lines corresponding to electron transitions to/from the n = 8 state 


Pfund series 
spectral lines corresponding to electron transitions to/from the n = 5 state 


postulates of Bohr’s model 
three assumptions that set a frame for Bohr’s model 


quantum number 
index that enumerates energy levels 


Rutherford’s gold foil experiment 
first experiment to demonstrate the existence of the atomic nucleus 


Rydberg constant for hydrogen 
physical constant in the Balmer formula 


Rydberg formula 
experimentally found positions of spectral lines of hydrogen atom 


De Broglie’s Matter Waves 
By the end of this section, you will be able to: 


e Describe de Broglie’s hypothesis of matter waves 

¢ Explain how the de Broglie’s hypothesis gives the rationale for the quantization of angular 
momentum in Bohr’s quantum theory of the hydrogen atom 

¢ Describe the Davisson—Germer experiment 

¢ Interpret de Broglie’s idea of matter waves and how they account for electron diffraction 
phenomena 


Compton’s formula established that an electromagnetic wave can behave like a particle of light 
when interacting with matter. In 1924, Louis de Broglie proposed a new speculative hypothesis 
that electrons and other particles of matter can behave like waves. Today, this idea is known as de 
Broglie’s hypothesis of matter waves. In 1926, De Broglie’s hypothesis, together with Bohr’s 
early quantum theory, led to the development of a new theory of wave quantum mechanics to 
describe the physics of atoms and subatomic particles. Quantum mechanics has paved the way for 
new engineering inventions and technologies, such as the laser and magnetic resonance imaging 
(MRI). These new technologies drive discoveries in other sciences such as biology and chemistry. 


According to de Broglie’s hypothesis, massless photons as well as massive particles must satisfy 
one common set of relations that connect the energy E with the frequency f, and the linear 
momentum p with the wavelength A. We have discussed these relations for photons in the context 
of Compton’s effect. We are recalling them now in a more general context. Any particle that has 
energy and momentum is a de Broglie wave of frequency f and wavelength X : 


Note: 
Equation: 
E=hf 
Note: 
Equation: 
h 
—— 
p 


Here, E and p are, respectively, the relativistic energy and the momentum of a particle. De 


Broglie’s relations are usually expressed in terms of the wave vector k, k = 27/ A, and the wave 
frequency w = 27f, as we usually do for waves: 
Equation: 


E = hw 


Equation: 


p= Ak. 


Wave theory tells us that a wave carries its energy with the group velocity. For matter waves, this 
group velocity is the velocity u of the particle. Identifying the energy E and momentum p of a 
particle with its relativistic energy mc? and its relativistic momentum mu, respectively, it follows 
from de Broglie relations that matter waves satisfy the following relation: 


Note: 
Equation: 


where 6 = u/c. When a particle is massless we have u = c and [link] becomes Af = c. 


Example: 

How Long Are de Broglie Matter Waves? 

Calculate the de Broglie wavelength of: (a) a 0.65-kg basketball thrown at a speed of 10 m/s, (b) a 
nonrelativistic electron with a kinetic energy of 1.0 eV, and (c) a relativistic electron with a kinetic 
energy of 108 keV. 

Strategy 

We use [link] to find the de Broglie wavelength. When the problem involves a nonrelativistic 
object moving with a nonrelativistic speed u, such as in (a) when 8 = u/c < 1, we use 
nonrelativistic momentum p. When the nonrelativistic approximation cannot be used, such as in 
(c), we must use the relativistic momentum p = mu = moyu = EoyG/c, where the rest mass 


energy of a particle is Ey = mc? and 7 is the Lorentz factor y = 1 / “i 1 — 82. The total energy 
E of a particle is given by [link] and the kinetic energy is K = E — Eo = (y — 1)Eo. When the 
kinetic energy is known, we can invert [link] to find the momentum 

p= (2 — E2)/c? = /K(K + 2Eo) /c and substitute in [link] to obtain 

Equation: 


oS he 


Pp maaan) 


Depending on the problem at hand, in this equation we can use the following values for hc: 
he = (6.626 x 10-*J-s)(2.998 x 10°m/s) = 1.986 x 10°? J-m=1.241eV- wm 
Solution 


a. For the basketball, the kinetic energy is 
Equation: 


K = mu? /2 = (0.65kg)(10 m/s)? /2 = 32.5 J 


and the rest mass energy is 
Equation: 


Ep = me? = (0.65 kg)(2.998 x 108 m/s)” = 5.84 x 10!°J. 


We see that K /(K + Eo) < land use p = mu = (0.65kg)(10 m/s) = 6.5J-s/m: 
Equation: 


h 6.626 x 10-*4J-s 


= = 1.02 x 107*4m. 
Pp 6.53 -s/m 


A= 


b. For the nonrelativistic electron, 
Equation: 


Ey = me? = (9.109 x 10-**kg)(2.998 x 108m/s)” = 511 keV 


and when K = 1.0 eV, we have K /(K + Ep) = (1/512) x 10°? <1, so we can use 
the nonrelativistic formula. However, it is simpler here to use [link]: 
Equation: 


h he 1.241 eV - um 


p> \/ K(K + 2Ep) 7 s/ (1.0 eV)[1.0 eV+2(511 keV)] 


N= = 1.23 nm. 


If we use nonrelativistic momentum, we obtain the same result because 1 eV is much smaller 
than the rest mass of the electron. 

. For a fast electron with K = 108 keV, relativistic effects cannot be neglected because its 
total energy is # = K + Eo = 108keV + 511 keV = 619 keV and K / FE = 108/619 is 
not negligible: 

Equation: 


() 


y= h he 1.241 eV - pm 


AAA _ = 3.55 pm. 
P  ./K(K+2E)) —_\/108 keV[108 keV + 2(511 keV) 


Significance 

We see from these estimates that De Broglie’s wavelengths of macroscopic objects such as a ball 
are immeasurably small. Therefore, even if they exist, they are not detectable and do not affect the 
motion of macroscopic objects. 


Note: 
Exercise: 


Problem: 


Check Your Understanding What is de Broglie’s wavelength of a nonrelativistic proton 
with a kinetic energy of 1.0 eV? 


Solution: 


29 pm 


Using the concept of the electron matter wave, de Broglie provided a rationale for the quantization 
of the electron’s angular momentum in the hydrogen atom, which was postulated in Bohr’s 
quantum theory. The physical explanation for the first Bohr quantization condition comes naturally 
when we assume that an electron in a hydrogen atom behaves not like a particle but like a wave. 
To see it clearly, imagine a stretched guitar string that is clamped at both ends and vibrates in one 
of its normal modes. If the length of the string is / ([link]), the wavelengths of these vibrations 
cannot be arbitrary but must be such that an integer k number of half-wavelengths 2 /2 fit exactly 
on the distance / between the ends. This is the condition / = kA /2 for a standing wave on a string. 
Now suppose that instead of having the string clamped at the walls, we bend its length into a circle 
and fasten its ends to each other. This produces a circular string that vibrates in normal modes, 
satisfying the same standing-wave condition, but the number of half-wavelengths must now be an 
even number k, k = 2n, and the length / is now connected to the radius r,, of the circle. This 
means that the radii are not arbitrary but must satisfy the following standing-wave condition: 
Equation: 


2mr, = 2n—. 
Tr oS 


If an electron in the nth Bohr orbit moves as a wave, by [link] its wavelength must be equal to 

A = 2rr,,/n. Assuming that [link] is valid, the electron wave of this wavelength corresponds to 
the electron’s linear momentum, p = h/A = nh/(2ar,) = nh/ry. Ina circular orbit, therefore, 
the electron’s angular momentum must be 

Equation: 


nh 
Ly = TnPp = Tn — = nh. 
Tn 


This equation is the first of Bohr’s quantization conditions, given by [link]. Providing a physical 
explanation for Bohr’s quantization condition is a convincing theoretical argument for the 
existence of matter waves. 


(a) (b) 


Standing-wave pattern: (a) a stretched string clamped at the walls; (b) an electron wave 
trapped in the third Bohr orbit in the hydrogen atom. 


Example: 

The Electron Wave in the Ground State of Hydrogen 

Find the de Broglie wavelength of an electron in the ground state of hydrogen. 

Strategy 

We combine the first quantization condition in [link] with [link] and use [link] for the first Bohr 
radius with n = 1. 

Solution 

When n = 1 andr, = ap = 0.529 A, the Bohr quantization condition gives 

agp = 1-h > p= h/apo. The electron wavelength is: 

Equation: 


A=h/p=h/h/ag = 2ma9 = 277(0.529 A) = 3.324 A. 


Significance 
We obtain the same result when we use [link] directly. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Find the de Broglie wavelength of an electron in the third 
excited state of hydrogen. 


Solution: 


N= Wray = 2(3.324 A) = 6.648 A 


Experimental confirmation of matter waves came in 1927 when C. Davisson and L. Germer 
performed a series of electron-scattering experiments that clearly showed that electrons do behave 
like waves. Davisson and Germer did not set up their experiment to confirm de Broglie’s 
hypothesis: The confirmation came as a byproduct of their routine experimental studies of metal 
surfaces under electron bombardment. 


In the particular experiment that provided the very first evidence of electron waves (known today 
as the Davisson—Germer experiment), they studied a surface of nickel. Their nickel sample was 
specially prepared in a high-temperature oven to change its usual polycrystalline structure to a 
form in which large single-crystal domains occupy the volume. [link] shows the experimental 
setup. Thermal electrons are released from a heated element (usually made of tungsten) in the 
electron gun and accelerated through a potential difference AV, becoming a well-collimated beam 
of electrons produced by an electron gun. The kinetic energy K of the electrons is adjusted by 
selecting a value of the potential difference in the electron gun. This produces a beam of electrons 
with a set value of linear momentum, in accordance with the conservation of energy: 

Equation: 


2 
eAV = K = > => p= V2meAV. 
m 


The electron beam is incident on the nickel sample in the direction normal to its surface. At the 
surface, it scatters in various directions. The intensity of the beam scattered in a selected direction 
y is measured by a highly sensitive detector. The detector’s angular position with respect to the 
direction of the incident beam can be varied from y = 0° to y = 90°. The entire setup is enclosed 
in a vacuum chamber to prevent electron collisions with air molecules, as such thermal collisions 
would change the electrons’ kinetic energy and are not desirable. 
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Schematics of the experimental setup of the 
Davisson—Germer diffraction experiment. A well- 
collimated beam of electrons is scattered off the 
nickel target. The kinetic energy of electrons in the 
incident beam is selected by adjusting a variable 
potential, AV, in the electron gun. Intensity of the 


scattered electron beam is measured for a range of 
scattering angles y, whereas the distance between 
the detector and the target does not change. 


When the nickel target has a polycrystalline form with many randomly oriented microscopic 
crystals, the incident electrons scatter off its surface in various random directions. As a result, the 
intensity of the scattered electron beam is much the same in any direction, resembling a diffuse 
reflection of light from a porous surface. However, when the nickel target has a regular crystalline 
structure, the intensity of the scattered electron beam shows a clear maximum at a specific angle 
and the results show a clear diffraction pattern (see [link]). Similar diffraction patterns formed by 
X-rays scattered by various crystalline solids were studied in 1912 by father-and-son physicists 
William H. Bragg and William L. Bragg. The Bragg law in X-ray crystallography provides a 
connection between the wavelength A of the radiation incident on a crystalline lattice, the lattice 
spacing, and the position of the interference maximum in the diffracted radiation (see Diffraction). 


The lattice spacing of the Davisson—Germer target, determined with X-ray crystallography, was 
measured to be a = 2.15 A. Unlike X-ray crystallography in which X-rays penetrate the sample, 
in the original Davisson—Germer experiment, only the surface atoms interact with the incident 
electron beam. For the surface diffraction, the maximum intensity of the reflected electron beam is 
observed for scattering angles that satisfy the condition nA = asiny (see [link]). The first-order 
maximum (for n = 1) is measured at a scattering angle of y ~ 50° at AV = 54V, which gives 
the wavelength of the incident radiation as \ = (2.15 A)sin50° = 1.64 A. On the other hand, a 
54-V potential accelerates the incident electrons to kinetic energies of K = 54 eV. Their 
momentum, calculated from [link], is p = 2.478 x 10-eV -s /m. When we substitute this 
result in [link], the de Broglie wavelength is obtained as 

Equation: 


4.1 10-beV - 
jae Bn 36 x 10°°eV-s 167A. 
Pp 2.478 x 10° °eV-s/m 


The same result is obtained when we use K = 54 eV in [link]. The proximity of this theoretical 
result to the Davisson—Germer experimental value of \ = 1.64 A is a convincing argument for the 
existence of de Broglie matter waves. 
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The experimental results of electron 
diffraction on a nickel target for the 
accelerating potential in the electron 
gun of about AV = 54V : The 
intensity maximum is registered at the 
scattering angle of about y = 50°. 
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In the surface diffraction of a 
monochromatic electromagnetic 
wave on a crystalline lattice 
structure, the in-phase incident 
beams are reflected from atoms on 
the surface. A ray reflected from the 
left atom travels an additional 
distance D = asiny to the detector, 
where a is the lattice spacing. The 
reflected beams remain in-phase 
when D is an integer multiple of 
their wavelength X. The intensity of 


the reflected waves has pronounced 
maxima for angles y satisfying 
nA = asingy. 


Diffraction lines measured with low-energy electrons, such as those used in the Davisson—Germer 
experiment, are quite broad (see [link]) because the incident electrons are scattered only from the 
surface. The resolution of diffraction images greatly improves when a higher-energy electron beam 
passes through a thin metal foil. This occurs because the diffraction image is created by scattering 
off many crystalline planes inside the volume, and the maxima produced in scattering at Bragg 
angles are sharp (see [link]). 


(a) (b) 


Diffraction patterns obtained in scattering on a crystalline solid: (a) with X-rays, and (b) with 
electrons. The observed pattern reflects the symmetry of the crystalline structure of the 
sample. 


Since the work of Davisson and Germer, de Broglie’s hypothesis has been extensively tested with 
various experimental techniques, and the existence of de Broglie waves has been confirmed for 
numerous elementary particles. Neutrons have been used in scattering experiments to determine 
crystalline structures of solids from interference patterns formed by neutron matter waves. The 
neutron has zero charge and its mass is comparable with the mass of a positively charged proton. 
Both neutrons and protons can be seen as matter waves. Therefore, the property of being a matter 
wave is not specific to electrically charged particles but is true of all particles in motion. Matter 
waves of molecules as large as carbon C9 have been measured. All physical objects, small or 
large, have an associated matter wave as long as they remain in motion. The universal character of 
de Broglie matter waves is firmly established. 


Example: 

Neutron Scattering 

Suppose that a neutron beam is used in a diffraction experiment on a typical crystalline solid. 
Estimate the kinetic energy of a neutron (in eV) in the neutron beam and compare it with kinetic 
energy of an ideal gas in equilibrium at room temperature. 

Strategy 

We assume that a typical crystal spacing a is of the order of 1.0 A. To observe a diffraction pattern 
on such a lattice, the neutron wavelength A must be on the same order of magnitude as the lattice 
spacing. We use [link] to find the momentum p and kinetic energy K. To compare this energy with 
the energy Er of ideal gas in equilibrium at room temperature T’ = 300K, we use the relation 

K =*;pkpT, where kp = 8.62 x 10-°eV /K is the Boltzmann constant. 

Solution 

We evaluate pc to compare it with the neutron’s rest mass energy Hyp = 940 MeV : 

Equation: 


he 1.241 x 10°°eV-m 


= 12.41 keV. 
A 107-1°m i 


2 = 
——— o— 
Pp X Pp 
We see that p?c? << E? so K < KE, and we can use the nonrelativistic kinetic energy: 
Equation: 
2 
p fe (6.63 x 10-*4J -s) 


— = = Se = 59? 10 OT = 827 meV. 
2mn  2d2mn (2 - x _:10-70m?)(1.66 x 10-?’kg) 


Kinetic energy of ideal gas in equilibrium at 300 K is: 
Equation: 


3 3 
Kr = a keT = 3 (8-62 x 10-°eV /K)(300K) = 38.8 MeV. 


We see that these energies are of the same order of magnitude. 

Significance 

Neutrons with energies in this range, which is typical for an ideal gas at room temperature, are 
called “thermal neutrons.” 


Example: 

Wavelength of a Relativistic Proton 

In a supercollider at CERN, protons can be accelerated to velocities of 0.75c. What are their de 
Broglie wavelengths at this speed? What are their kinetic energies? 

Strategy 

The rest mass energy of a proton is 


Ep = moc? = (1.672 x 10-2"kg)(2.998 x 108m/s)” = 938 MeV. When the proton’s 


velocity is known, we have 6 = 0.75 and By = 0.75/V1— 0.757 = 1.714. We obtain the 
wavelength A and kinetic energy K from relativistic relations. 
Solution 


Equation: 


ae ia eee ee 
p pe  ByE 1.714(938 MeV) 
Equation: 
K = Ey(y— 1) = 938 MeV(1/ V1 — 0.75? — 1) = 480.1 MeV 
Significance 


Notice that because a proton is 1835 times more massive than an electron, if this experiment were 
performed with electrons, a simple rescaling of these results would give us the electron’s 
wavelength of (1835)0.77fm = 1.4 pm and its kinetic energy of 

480.1 MeV /1835 = 261.6 keV. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Find the de Broglie wavelength and kinetic energy of a free 
electron that travels at a speed of 0.75c. 


Solution: 


A = 2.14pm; K = 261.56 keV 


Summary 


¢ De Broglie’s hypothesis of matter waves postulates that any particle of matter that has linear 
momentum is also a wave. The wavelength of a matter wave associated with a particle is 
inversely proportional to the magnitude of the particle’s linear momentum. The speed of the 
matter wave is the speed of the particle. 

¢ De Broglie’s concept of the electron matter wave provides a rationale for the quantization of 
the electron’s angular momentum in Bohr’s model of the hydrogen atom. 

¢ In the Davisson—Germer experiment, electrons are scattered off a crystalline nickel surface. 
Diffraction patterns of electron matter waves are observed. They are the evidence for the 
existence of matter waves. Matter waves are observed in diffraction experiments with various 
particles. 


Conceptual Questions 


Exercise: 


Problem: 


Which type of radiation is most suitable for the observation of diffraction patterns on 
crystalline solids; radio waves, visible light, or X-rays? Explain. 


Solution: 


X-rays, best resolving power 
Exercise: 
Problem: 
Speculate as to how the diffraction patterns of a typical crystal would be affected if y-rays 
were used instead of X-rays. 
Exercise: 
Problem: 


If an electron and a proton are traveling at the same speed, which one has the shorter de 
Broglie wavelength? 


Solution: 
proton 


Exercise: 


Problem: If a particle is accelerating, how does this affect its de Broglie wavelength? 
Exercise: 


Problem: 

Why is the wave-like nature of matter not observed every day for macroscopic objects? 
Solution: 

negligibly small de Broglie’s wavelengths 


Exercise: 


Problem: What is the wavelength of a neutron at rest? Explain. 
Exercise: 


Problem: 


Why does the setup of Davisson—Germer experiment need to be enclosed in a vacuum 
chamber? Discuss what result you expect when the chamber is not evacuated. 


Solution: 


to avoid collisions with air molecules 


Problems 


Exercise: 


Problem: At what velocity will an electron have a wavelength of 1.00 m? 
Exercise: 


Problem: 
What is the de Broglie wavelength of an electron travelling at a speed of 5.0 x 10°m /s? 


Solution: 
145.5 pm 


Exercise: 


Problem: 


What is the de Broglie wavelength of an electron that is accelerated from rest through a 
potential difference of 20 keV? 


Exercise: 


Problem: 
What is the de Broglie wavelength of a proton whose kinetic energy is 2.0 MeV? 10.0 MeV? 


Solution: 
20 fm; 9 fm 


Exercise: 


Problem: 


What is the de Broglie wavelength of a 10-kg football player running at a speed of 8.0 m/s? 
Exercise: 

Problem: 

(a) What is the energy of an electron whose de Broglie wavelength is that of a photon of 


yellow light with wavelength 590 nm? (b) What is the de Broglie wavelength of an electron 
whose energy is that of the photon of yellow light? 


Solution: 


a. 2.103 eV; b. 0.846 nm 


Exercise: 


Problem: 


The de Broglie wavelength of a neutron is 0.01 nm. What is the speed and energy of this 
neutron? 


Exercise: 


Problem: What is the wavelength of an electron that is moving at a 3% of the speed of light? 


Solution: 


80.9 pm 
Exercise: 


Problem: 


At what velocity does a proton have a 6.0-fm wavelength (about the size of a nucleus)? Give 
your answer in units of c. 


Exercise: 
Problem: What is the velocity of a 0.400-kg billiard ball if its wavelength is 7.50 fm? 


Solution: 


2.21 x 107'°m/s 
Exercise: 


Problem: 
Find the wavelength of a proton that is moving at 1.00% of the speed of light (when 
8 = 0.01). 

Glossary 


Davisson—Germer experiment 
historically first electron-diffraction experiment that revealed electron waves 


de Broglie’s hypothesis of matter waves 
particles of matter can behave like waves 


de Broglie wave 
matter wave associated with any object that has mass and momentum 


group velocity 
velocity of a wave, energy travels with the group velocity 


wave quantum mechanics 
theory that explains the physics of atoms and subatomic particles 


Wave-Particle Duality 
By the end of this section, you will be able to: 


e Identify phenomena in which electromagnetic waves behave like a beam of photons and 
particles behave like waves 

e Describe the physics principles behind electron microscopy 

¢ Summarize the evolution of scientific thought that led to the development of quantum 
mechanics 


The energy of radiation detected by a radio-signal receiving antenna comes as the energy of an 
electromagnetic wave. The same energy of radiation detected by a photocurrent in the 
photoelectric effect comes as the energy of individual photon particles. Therefore, the question 
arises about the nature of electromagnetic radiation: Is a photon a wave or is it a particle? 
Similar questions may be asked about other known forms of energy. For example, an electron 
that forms part of an electric current in a circuit behaves like a particle moving in unison with 
other electrons inside the conductor. The same electron behaves as a wave when it passes 
through a solid crystalline structure and forms a diffraction image. Is an electron a wave or is it 
a particle? The same question can be extended to all particles of matter—elementary particles, 
as well as compound molecules—asking about their true physical nature. At our present state 
of knowledge, such questions about the true nature of things do not have conclusive answers. 
All we can say is that wave-particle duality exists in nature: Under some experimental 
conditions, a particle appears to act as a particle, and under different experimental conditions, a 
particle appears to act a wave. Conversely, under some physical circumstances electromagnetic 
radiation acts as a wave, and under other physical circumstances, radiation acts as a beam of 
photons. 


This dualistic interpretation is not a new physics concept brought about by specific discoveries 
in the twentieth century. It was already present in a debate between Isaac Newton and 
Christiaan Huygens about the nature of light, beginning in the year 1670. According to 
Newton, a beam of light is a collection of corpuscles of light. According to Huygens, light is a 
wave. The corpuscular hypothesis failed in 1803, when Thomas Young announced his double- 
slit interference experiment with light (see [link]), which firmly established light as a wave. 
In James Clerk Maxwell’s theory of electromagnetism (completed by the year 1873), light is an 
electromagnetic wave. Maxwell’s classical view of radiation as an electromagnetic wave is still 
valid today; however, it is unable to explain blackbody radiation and the photoelectric effect, 
where light acts as a beam of photons. 


Young’s double-slit experiment explains the 
interference of light by making an analogy with the 
interference of water waves. Two waves are generated 
at the positions of two slits in an opaque screen. The 
waves have the same wavelengths. They travel from 
their origins at the slits to the viewing screen placed to 
the right of the slits. The waves meet on the viewing 
screen. At the positions marked “Max” on the screen, 
the meeting waves are in-phase and the combined wave 
amplitude is enhanced. At positions marked “Min,” the 
combined wave amplitude is zero. For light, this 
mechanism creates a bright-and-dark fringe pattern on 
the viewing screen. 


A similar dichotomy existed in the interpretation of electricity. From Benjamin Franklin’s 
observations of electricity in 1751 until J.J. Thomson’s discovery of the electron in 1897, 
electric current was seen as a flow in a continuous electric medium. Within this theory of 
electric fluid, the present theory of electric circuits was developed, and electromagnetism and 
electromagnetic induction were discovered. Thomson’s experiment showed that the unit of 
negative electric charge (an electron) can travel in a vacuum without any medium to carry the 
charge around, as in electric circuits. This discovery changed the way in which electricity is 
understood today and gave the electron its particle status. In Bohr’s early quantum theory of the 


hydrogen atom, both the electron and the proton are particles of matter. Likewise, in the 
Compton scattering of X-rays on electrons, the electron is a particle. On the other hand, in 
electron-scattering experiments on crystalline structures, the electron behaves as a wave. 


A skeptic may raise a question that perhaps an electron might always be nothing more than a 
particle, and that the diffraction images obtained in electron-scattering experiments might be 
explained within some macroscopic model of a crystal and a macroscopic model of electrons 
coming at it like a rain of ping-pong balls. As a matter of fact, to investigate this question, we 
do not need a complex model of a crystal but just a couple of simple slits in a screen that is 
opaque to electrons. In other words, to gather convincing evidence about the nature of an 
electron, we need to repeat the Young double-slit experiment with electrons. If the electron is a 
wave, we should observe the formation of interference patterns typical for waves, such as those 
described in [link], even when electrons come through the slits one by one. However, if the 
electron is a not a wave but a particle, the interference fringes will not be formed. 


The very first double-slit experiment with a beam of electrons, performed by Claus Jénsson in 
Germany in 1961, demonstrated that a beam of electrons indeed forms an interference pattern, 
which means that electrons collectively behave as a wave. The first double-slit experiments 
with single electrons passing through the slits one-by-one were performed by Giulio Pozzi in 
1974 in Italy and by Akira Tonomura in 1989 in Japan. They show that interference fringes are 
formed gradually, even when electrons pass through the slits individually. This demonstrates 
conclusively that electron-diffraction images are formed because of the wave nature of 
electrons. The results seen in double-slit experiments with electrons are illustrated by the 
images of the interference pattern a [link]. E 


Computer-simulated interference fringes seen in the Young double-slit experiment with 
electrons. One pattern is gradually formed on the screen, regardless of whether the 
electrons come through the slits as a beam or individually one-by-one. 


Example: 

Double-Slit Experiment with Electrons 

In one experimental setup for studying interference patterns of electron waves, two slits are 
created in a gold-coated silicon membrane. Each slit is 62-nm wide and 4-t1m long, and the 
separation between the slits is 272 nm. The electron beam is created in an electron gun by 
heating a tungsten element and by accelerating the electrons across a 600-V potential. The 
beam is subsequently collimated using electromagnetic lenses, and the collimated beam of 
electrons is sent through the slits. Find the angular position of the first-order bright fringe on 
the viewing screen. 

Strategy 


Recall that the angular position 0 of the nth order bright fringe that is formed in Young’s two- 
slit interference pattern (discussed in a previous chapter) is related to the separation, d, 
between the slits and to the wavelength, X, of the incident light by the equation dsin@ = nX, 
where n = 0, +1, +2,.... The separation is given and is equal to d = 272 nm. For the first- 
order fringe, we take n = 1. The only thing we now need is the wavelength of the incident 
electron wave. 

Since the electron has been accelerated from rest across a potential difference of AV = 600V, 
its kinetic energy is K = eAV = 600 eV. The rest-mass energy of the electron is 

Eo = 511 keV. 

We compute its de Broglie wavelength as that of a nonrelativistic electron because its kinetic 
energy / is much smaller than its rest energy ky, K < Epo. 

Solution 

The electron’s wavelength is 

Equation: 


eee h — he 1.241 x 10°°eV-m 
Pp VQ2meK /2E)/?K V2Ex0K  4/2(511keV)(600eV) 


This A is used to obtain the position of the first bright fringe: 
Equation: 


= 0.050 nm. 


Sele ees 2 UE nee! en 
d 272 nm 


Significance 

Notice that this is also the angular resolution between two consecutive bright fringes up to 
about n = 1000. For example, between the zero-order fringe and the first-order fringe, 
between the first-order fringe and the second-order fringe, and so on. 


Note: 
Exercise: 


Problem: 


Check Your Understanding For the situation described in [link], find the angular 
position of the fifth-order bright fringe on the viewing screen. 


Solution: 


0.052° 


The wave-particle dual nature of matter particles and of radiation is a declaration of our 
inability to describe physical reality within one unified classical theory because separately 


neither a classical particle approach nor a classical wave approach can fully explain the 
observed phenomena. This limitation of the classical approach was realized by the year 1928, 
and a foundation for a new statistical theory, called quantum mechanics, was put in place by 
Bohr, Edwin Schrédinger, Werner Heisenberg, and Paul Dirac. Quantum mechanics takes de 
Broglie’s idea of matter waves to be the fundamental property of all particles and gives it a 
statistical interpretation. According to this interpretation, a wave that is associated with a 
particle carries information about the probable positions of the particle and about its other 
properties. A single particle is seen as a moving wave packet such as the one shown in [link]. 
We can intuitively sense from this example that if a particle is a wave packet, we will not be 
able to measure its exact position in the same sense as we cannot pinpoint a location of a wave 
packet in a vibrating guitar string. The uncertainty, Az, in measuring the particle’s position is 
connected to the uncertainty, Ap, in the simultaneous measuring of its linear momentum by 
Heisenberg’s uncertainty principle: 


Note: 
Equation: 


AzAp > of 


Heisenberg’s principle expresses the law of nature that, at the quantum level, our perception is 
limited. For example, if we know the exact position of a body (which means that Az = 0 in 
[link]) at the same time we cannot know its momentum, because then the uncertainty in its 
momentum becomes infinite (because Ap > 0.5% / Az in [link]). The Heisenberg 
uncertainty principle sets the limit on the precision of simultaneous measurements of position 
and momentum of a particle; it shows that the best precision we can obtain is when we have an 
equals sign (=) in [link], and we cannot do better than that, even with the best instruments of 
the future. Heisenberg’s principle is a consequence of the wave nature of particles. 


In this graphic, a particle is shown as a wave packet 


and its position does not have an exact value. 


We routinely use many electronic devices that exploit wave-particle duality without even 
realizing the sophistication of the physics underlying their operation. One example of a 
technology based on the particle properties of photons and electrons is a charge-coupled 
device, which is used for light detection in any instrumentation where high-quality digital data 
are required, such as in digital cameras or in medical sensors. An example in which the wave 
properties of electrons is exploited is an electron microscope. 


In 1931, physicist Ernst Ruska—building on the idea that magnetic fields can direct an electron 
beam just as lenses can direct a beam of light in an optical microscope—developed the first 
prototype of the electron microscope. This development originated the field of electron 
microscopy. In the transmission electron microscope (TEM), shown in [link], electrons are 
produced by a hot tungsten element and accelerated by a potential difference in an electron 
gun, which gives them up to 400 keV in kinetic energy. After leaving the electron gun, the 
electron beam is focused by electromagnetic lenses (a system of condensing lenses) and 
transmitted through a specimen sample to be viewed. The image of the sample is reconstructed 
from the transmitted electron beam. The magnified image may be viewed either directly on a 
fluorescent screen or indirectly by sending it, for example, to a digital camera or a computer 
monitor. The entire setup consisting of the electron gun, the lenses, the specimen, and the 
fluorescent screen are enclosed in a vacuum chamber to prevent the energy loss from the beam. 
Resolution of the TEM is limited only by spherical aberration (discussed in a previous chapter). 
Modern high-resolution models of a TEM can have resolving power greater than 0.5 A and 
magnifications higher than 50 million times. For comparison, the best resolving power obtained 
with light microscopy is currently about 97 nm. A limitation of the TEM is that the samples 
must be about 100-nm thick and biological samples require a special preparation involving 
chemical “fixing” to stabilize them for ultrathin slicing. 
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TEM: An electron beam produced by an electron gun is collimated by condenser 
lenses and passes through a specimen. The transmitted electrons are projected on a 
screen and the image is sent to a camera. (credit: modification of work by Dr. Graham 
Beards) 


Such limitations do not appear in the scanning electron microscope (SEM), which was invented 
by Manfred von Ardenne in 1937. In an SEM, a typical energy of the electron beam is up to 40 
keV and the beam is not transmitted through a sample but is scattered off its surface. Surface 
topography of the sample is reconstructed by analyzing back-scattered electrons, transmitted 
electrons, and the emitted radiation produced by electrons interacting with atoms in the sample. 
The resolving power of an SEM is better than 1 nm, and the magnification can be more than 
250 times better than that obtained with a light microscope. The samples scanned by an SEM 
can be as large as several centimeters but they must be specially prepared, depending on 
electrical properties of the sample. 


High magnifications of the TEM and SEM allow us to see individual molecules. High 


resolving powers of the TEM and SEM allow us to see fine details, such as those shown in the 
SEM micrograph of pollen at the beginning of this chapter ({link]). 


Example: 


Resolving Power of an Electron Microscope 

If a 1.0-pm electron beam of a TEM passes through a 2.0-wm circular opening, what is the 
angle between the two just-resolvable point sources for this microscope? 

Solution 

We can directly use a formula for the resolving power, A@, of a microscope (discussed in a 
previous chapter) when the wavelength of the incident radiation is 1 = 1.0 pm and the 
diameter of the aperture is D = 2.0m : 


Equation: 
nN 1.0 
Ao = 1.292 = 1.29-—P™ — 6.10 x 10-7rad = 3.50 x 10-®degree. 
D 2.0um 
Significance 


Note that if we used a conventional microscope with a 400-nm light, the resolving power 
would be only 14°, which means that all of the fine details in the image would be blurred. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Suppose that the diameter of the aperture in [link] is halved. 
How does it affect the resolving power? 


Solution: 


doubles it 


Summary 


e Wave-particle duality exists in nature: Under some experimental conditions, a particle acts 
as a particle; under other experimental conditions, a particle acts as a wave. Conversely, 
under some physical circumstances, electromagnetic radiation acts as a wave, and under 
other physical circumstances, radiation acts as a beam of photons. 

¢ Modern-era double-slit experiments with electrons demonstrated conclusively that 
electron-diffraction images are formed because of the wave nature of electrons. 

e The wave-particle dual nature of particles and of radiation has no classical explanation. 

¢ Quantum theory takes the wave property to be the fundamental property of all particles. A 
particle is seen as a moving wave packet. The wave nature of particles imposes a 
limitation on the simultaneous measurement of the particle’s position and momentum. 
Heisenberg’s uncertainty principle sets the limits on precision in such simultaneous 
measurements. 


e Wave-particle duality is exploited in many devices, such as charge-couple devices (used in 
digital cameras) or in the electron microscopy of the scanning electron microscope (SEM) 
and the transmission electron microscope (TEM). 


Key Equations 
Wien’s displacement law Amaxl' = 2.898 x 10-°m-K 
Stefan’s law P(T) = cAT* 
Planck’s constant h = 6.626 x 10-“4J-s = 4.136 x 10°"eV-s 
Energy quantum of radiation AE = hf 
Planck’s blackbody radiation law LGD = 2ahet sae 
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Conceptual Questions 


Exercise: 


Problem: 


Give an example of an experiment in which light behaves as waves. Give an example of 
an experiment in which light behaves as a stream of photons. 


Exercise: 


Problem: 


Discuss: How does the interference of water waves differ from the interference of 
electrons? How are they analogous? 


Solution: 
Answers may vary 


Exercise: 


Problem: Give at least one argument in support of the matter-wave hypothesis. 
Exercise: 

Problem: Give at least one argument in support of the particle-nature of radiation. 

Solution: 

Answers may vary 


Exercise: 


Problem: Explain the importance of the Young double-slit experiment. 
Exercise: 


Problem: 


Does the Heisenberg uncertainty principle allow a particle to be at rest in a designated 
region in space? 


Solution: 


yes 


Exercise: 


Problem: Can the de Broglie wavelength of a particle be known exactly? 


Exercise: 


Problem: 


Do the photons of red light produce better resolution in a microscope than blue light 
photons? Explain. 


Solution: 
yes 


Exercise: 


Problem: Discuss the main difference between an SEM and a TEM. 


Problems 


Exercise: 


Problem: 


An AM radio transmitter radiates 500 kW at a frequency of 760 kHz. How many photons 
per second does the emitter emit? 


Solution: 


9.929 x 10°? 
Exercise: 


Problem: 


Find the Lorentz factor y and de Broglie’s wavelength for a 50-GeV electron in a particle 
accelerator. 


Exercise: 


Problem: 


Find the Lorentz factor 7 and de Broglie’s wavelength for a 1.0-TeV proton in a particle 
accelerator. 


Solution: 


+ = 1060; 0.00124 fm 


Exercise: 


Problem: What is the kinetic energy of a 0.01-nm electron ina TEM? 


Exercise: 


Problem: 


If electron is to be diffracted significantly by a crystal, its wavelength must be about equal 
to the spacing, d, of crystalline planes. Assuming d = 0.250 nm, estimate the potential 
difference through which an electron must be accelerated from rest if it is to be diffracted 
by these planes. 


Solution: 


24.11 V 
Exercise: 


Problem: 


X-rays form ionizing radiation that is dangerous to living tissue and undetectable to the 
human eye. Suppose that a student researcher working in an X-ray diffraction laboratory is 
accidentally exposed to a fatal dose of radiation. Calculate the temperature increase of the 
researcher under the following conditions: the energy of X-ray photons is 200 keV and the 
researcher absorbs 4 x 10° photons per each kilogram of body weight during the 
exposure. Assume that the specific heat of the student’s body is 0.83kcal /kg - K. 


Exercise: 


Problem: 


Solar wind (radiation) that is incident on the top of Earth’s atmosphere has an average 
intensity of 1.3kW /m7?. Suppose that you are building a solar sail that is to propel a 
small toy spaceship with a mass of 0.1 kg in the space between the International Space 
Station and the moon. The sail is made from a very light material, which perfectly reflects 
the incident radiation. To assess whether such a project is feasible, answer the following 
questions, assuming that radiation photons are incident only in normal direction to the sail 
reflecting surface. (a) What is the radiation pressure (force per m7”) of the radiation falling 
on the mirror-like sail? (b) Given the radiation pressure computed in (a), what will be the 
acceleration of the spaceship when the sail has of an area of 10.0m? ? (c) Given the 
acceleration estimate in (b), how fast will the spaceship be moving after 24 hours when it 
starts from rest? 


Solution: 


a. P = 21 /c=8.67 x 10° °N/m?;b.a = PA/m = 8.67 x 10~-4m/s?; c. 74.91 
m/s 

Exercise: 
Problem: 


Treat the human body as a blackbody and determine the percentage increase in the total 
power of its radiation when its temperature increases from 98.6 ° F to 103 ° F. 


Exercise: 


Problem: 


Show that Wien’s displacement law results from Planck’s radiation law. (Hint: substitute 
x = he/ kT and write Planck’s law in the form I(z,T) = Ax? /(e* — 1), where 


A = 2n(kT)” / (h4c3). Now, for fixed T, find the position of the maximum in I(x,T) by 
solving for x in the equation dI(x, T) /dx = 0.) 


Solution: 


xz = 4.965 
Exercise: 


Problem: 


Show that Stefan’s law results from Planck’s radiation law. Hint: To compute the total 
power of blackbody radiation emitted across the entire spectrum of wavelengths at a given 
Co 


temperature, integrate Planck’s law over the entire spectrum P(T’) = | I(A,T)dd. 
0 


Use the substitution 2 = he / AkT and the tabulated value of the integral 


[p aee® j(e* =1) = 4/16 


Additional Problems 


Exercise: 


Problem: 


Determine the power intensity of radiation per unit wavelength emitted at a wavelength of 
500.0 nm by a blackbody at a temperature of 10,000 K. 


Solution: 


7.124 x 10!°W/m? 
Exercise: 
Problem: 
The HCl molecule oscillates at a frequency of 87.0 THz. What is the difference (in eV) 
between its adjacent energy levels? 


Exercise: 


Problem: 


A quantum mechanical oscillator vibrates at a frequency of 250.0 THz. What is the 
minimum energy of radiation it can emit? 


Solution: 


1.034 eV 

Exercise: 
Problem: 
In about 5 billion years, the sun will evolve to a red giant. Assume that its surface 
temperature will decrease to about half its present value of 6000 K, while its present 
radius of 7.0 x 10°m will increase to 1.5 x 10''m (which is the current Earth-sun 


distance). Calculate the ratio of the total power emitted by the sun in its red giant stage to 
its present power. 


Exercise: 
Problem: 


A sodium lamp emits 2.0 W of radiant energy, most of which has a wavelength of about 
589 nm. Estimate the number of photons emitted per second by the lamp. 


Solution: 


5:93 x 10 
Exercise: 
Problem: 
Photoelectrons are ejected from a photoelectrode and are detected at a distance of 2.50 cm 
away from the photoelectrode. The work function of the photoelectrode is 2.71 eV and the 


incident radiation has a wavelength of 420 nm. How long does it take a photoelectron to 
travel to the detector? 


Exercise: 


Problem: 


If the work function of a metal is 3.2 eV, what is the maximum wavelength that a photon 
can have to eject a photoelectron from this metal surface? 


Solution: 


387.8 nm 


Exercise: 


Problem: 
The work function of a photoelectric surface is 2.00 eV. What is the maximum speed of 
the photoelectrons emitted from this surface when a 450-nm light falls on it? 
Exercise: 
Problem: 
A 400-nm laser beam is projected onto a calcium electrode. The power of the laser beam 


is 2.00 mW and the work function of calcium is 2.31 eV. (a) How many photoelectrons per 
second are ejected? (b) What net power is carried away by photoelectrons? 


Solution: 


a. 4.02 x 10/°; b. 0.533 mW 
Exercise: 


Problem: 


(a) Calculate the number of photoelectrons per second that are ejected from a 1.00-mm7 
area of sodium metal by a 500-nm radiation with intensity 1.30kW/ m? (the intensity of 
sunlight above Earth’s atmosphere). (b) Given the work function of the metal as 2.28 eV, 
what power is carried away by these photoelectrons? 


Exercise: 


Problem: 


A laser with a power output of 2.00 mW at a 400-nm wavelength is used to project a beam 
of light onto a calcium photoelectrode. (a) How many photoelectrons leave the calcium 
surface per second? (b) What power is carried away by ejected photoelectrons, given that 
the work function of calcium is 2.31 eV? (c) Calculate the photocurrent. (d) If the 
photoelectrode suddenly becomes electrically insulated and the setup of two electrodes in 
the circuit suddenly starts to act like a 2.00-pF capacitor, how long will current flow 
before the capacitor voltage stops it? 


Solution: 


a. 4.02 x 10°; b. 0.533 mW; c. 0.644 mA; d. 2.57 ns 
Exercise: 


Problem: 


The work function for barium is 2.48 eV. Find the maximum kinetic energy of the ejected 
photoelectrons when the barium surface is illuminated with: (a) radiation emitted by a 
100-kW radio station broadcasting at 800 kHz; (b) a 633-nm laser light emitted from a 
powerful He-Ne laser; and (c) a 434-nm blue light emitted by a small hydrogen gas 
discharge tube. 


Exercise: 
Problem: 
(a) Calculate the wavelength of a photon that has the same momentum as a proton moving 


with 1% of the speed of light in a vacuum. (b) What is the energy of this photon in MeV? 
(c) What is the kinetic energy of the proton in MeV? 


Solution: 


a. 0.132 pm; b. 9.39 MeV; c. 0.047 MeV 

Exercise: 
Problem: 
(a) Find the momentum of a 100-keV X-ray photon. (b) Find the velocity of a neutron 
with the same momentum. (c) What is the neutron’s kinetic energy in eV? 

Exercise: 
Problem: 
The momentum of light, as it is for particles, is exactly reversed when a photon is 
reflected straight back from a mirror, assuming negligible recoil of the mirror. The change 
in momentum is twice the photon’s incident momentum, as it is for the particles. Suppose 
that a beam of light has an intensity 1.0kW /m? and falls on a —2.0-m? area of a mirror 
and reflects from it. (a) Calculate the energy reflected in 1.00 s. (b) What is the 


momentum imparted to the mirror? (c) Use Newton’s second law to find the force on the 
mirror. (d) Does the assumption of no-recoil for the mirror seem reasonable? 


Solution: 


a. 2kJ;b. 1.33 x 10-°kg- m/s; c. 1.33 x 107° N;d. yes 
Exercise: 
Problem: 
A photon of energy 5.0 keV collides with a stationary electron and is scattered at an angle 
of 60°. What is the energy acquired by the electron in the collision? 
Exercise: 
Problem: 
A 0.75-nm photon is scattered by a stationary electron. The speed of the electron’s recoil 


is 1.5 x 10°m /s. (a) Find the wavelength shift of the photon. (b) Find the scattering 
angle of the photon. 


Solution: 


a. 0.003 nm; b. 105.56° 
Exercise: 
Problem: 
Find the maximum change in X-ray wavelength that can occur due to Compton scattering. 
Does this change depend on the wavelength of the incident beam? 
Exercise: 
Problem: 
A photon of wavelength 700 nm is incident on a hydrogen atom. When this photon is 


absorbed, the atom becomes ionized. What is the lowest possible orbit that the electron 
could have occupied before being ionized? 


Solution: 


n=3 
Exercise: 

Problem: 

What is the maximum kinetic energy of an electron such that a collision between the 

electron and a stationary hydrogen atom in its ground state is definitely elastic? 
Exercise: 

Problem: 

Singly ionized atomic helium He*™? is a hydrogen-like ion. (a) What is its ground-state 


radius? (b) Calculate the energies of its four lowest energy states. (c) Repeat the 
calculations for the Li?* ion. 


Solution: 


a. ag / 2; b. —54.4eV/n?; c. ag /3, —122.4eV/n? 
Exercise: 
Problem: 
A triply ionized atom of beryllium Be** is a hydrogen-like ion. When Be** is in one of 


its excited states, its radius in this nth state is exactly the same as the radius of the first 
Bohr orbit of hydrogen. Find n and compute the ionization energy for this state of Be?". 


Exercise: 


Problem: 


In extreme-temperature environments, such as those existing in a solar corona, atoms may 
be ionized by undergoing collisions with other atoms. One example of such ionization in 
the solar corona is the presence of C°* ions, detected in the Fraunhofer spectrum. (a) By 
what factor do the energies of the C** ion scale compare to the energy spectrum of a 
hydrogen atom? (b) What is the wavelength of the first line in the Paschen series of C°* ? 
(c) In what part of the spectrum are these lines located? 


Solution: 


a. 36; b. 18.2 nm; c. UV 
Exercise: 
Problem: 
(a) Calculate the ionization energy for He*. (b) What is the minimum frequency of a 
photon capable of ionizing He* ? 
Exercise: 
Problem: 


Experiments are performed with ultracold neutrons having velocities as small as 1.00 m/s. 
Find the wavelength of such an ultracold neutron and its kinetic energy. 


Solution: 
396 nm; 5.23 neV 


Exercise: 


Problem: 


Find the velocity and kinetic energy of a 6.0-fm neutron. (Rest mass energy of neutron is 
Eo = 940 MeV.) 

Exercise: 
Problem: 
The spacing between crystalline planes in the NaCl crystal is 0.281 nm, as determined by 
X-ray diffraction with X-rays of wavelength 0.170 nm. What is the energy of neutrons in 


the neutron beam that produces diffraction peaks at the same locations as the peaks 
obtained with the X-rays? 


Solution: 


7.3 keV 


Exercise: 


Problem: 
What is the wavelength of an electron accelerated from rest in a 30.0-kV potential 
difference? 
Exercise: 
Problem: 


Calculate the velocity of a 1.0-j1m electron and a potential difference used to accelerate it 
from rest to this velocity. 


Solution: 


728 m/s; 1.5uV 
Exercise: 
Problem: 
In a supercollider at CERN, protons are accelerated to velocities of 0.25c. What are their 
wavelengths at this speed? What are their kinetic energies? If a beam of protons were to 


gain its kinetic energy in only one pass through a potential difference, how high would 
this potential difference have to be? (Rest mass energy of a proton is Hy = 938 MeV). 


Exercise: 
Problem: 


Find the de Broglie wavelength of an electron accelerated from rest in an X-ray tube in the 
potential difference of 100 keV. (Rest mass energy of an electron is Hy = 511 keV.) 


Solution: 

\ = he/./K(2E) + K) = 3.705 x 10°"? m, K = 100keV 
Exercise: 

Problem: 


The cutoff wavelength for the emission of photoelectrons from a particular surface is 500 
nm. Find the maximum kinetic energy of the ejected photoelectrons when the surface is 
illuminated with light of wavelength 450 nm. 


Exercise: 


Problem: 


Compare the wavelength shift of a photon scattered by a free electron to that of a photon 
scattered at the same angle by a free proton. 


Solution: 
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Exercise: 
Problem: 
The spectrometer used to measure the wavelengths of the scattered X-rays in the Compton 
experiment is accurate to 5.0 x 10°-‘nm. What is the minimum scattering angle for 


which the X-rays interacting with the free electrons can be distinguished from those 
interacting with the atoms? 


Exercise: 
Problem: 
Consider a hydrogen-like ion where an electron is orbiting a nucleus that has charge 


q = +Ze. Derive the formulas for the energy £,, of the electron in nth orbit and the 
orbital radius 7,,. 


Solution: 


(Proof) 
Exercise: 
Problem: 
Assume that a hydrogen atom exists in the n = 2 excited state for 10~°s before decaying 


to the ground state. How many times does the electron orbit the proton nucleus during this 
time? How long does it take Earth to orbit the sun this many times? 


Exercise: 


Problem: 


An atom can be formed when a negative muon is captured by a proton. The muon has the 
same charge as the electron and a mass 207 times that of the electron. Calculate the 
frequency of the photon emitted when this atom makes the transition from n = 2 to the 
nm = 1 state. Assume that the muon is orbiting a stationary proton. 


Solution: 


5.1 x 10!’Hz 


Glossary 


double-slit interference experiment 
Young’s double-slit experiment, which shows the interference of waves 


electron microscopy 
microscopy that uses electron waves to “see” fine details of nano-size objects 


Heisenberg uncertainty principle 
sets the limits on precision in simultaneous measurements of momentum and position of a 
particle 


wave-particle duality 
particles can behave as waves and radiation can behave as particles 


Introduction 
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A D-wave 
qubit 
processor: 
The brain of 
a quantum 
computer 
that encodes 
information 
in quantum 
bits to 
perform 
complex 
calculations. 
(credit: 
modificatio 
n of work 
by D-Wave 
Systems, 
Inc.) 
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Quantum mechanics is a powerful framework for understanding the 
motions and interactions of particles at small scales, such as atoms and 
molecules. The ideas behind quantum mechanics often appear quite strange. 
In many ways, our everyday experience with the macroscopic physical 
world does not prepare us for the microscopic world of quantum mechanics. 
The purpose of this chapter is to introduce you to this exciting world. 


Pictured above is a quantum-computer processor. This device is the “brain” 
of a quantum computer that operates at near-absolute zero temperatures. 
Unlike a digital computer, which encodes information in binary digits 
(definite states of either zero or one), a quantum computer encodes 
information in quantum bits or qubits (mixed states of zero and one). 
Quantum computers are discussed in the first section of this chapter. 


Wave Functions 
By the end of this section, you will be able to: 


e Describe the statistical interpretation of the wave function 
e Use the wave function to determine probabilities 
e Calculate expectation values of position, momentum, and kinetic energy 


In the preceding chapter, we saw that particles act in some cases like particles and in other cases like waves. But 
what does it mean for a particle to “act like a wave”? What precisely is “waving”? What rules govern how this 
wave changes and propagates? How is the wave function used to make predictions? For example, if the amplitude 
of an electron wave is given by a function of position and time, Y(z, t), defined for all x, where exactly is the 
electron? The purpose of this chapter is to answer these questions. 


Using the Wave Function 


A clue to the physical meaning of the wave function W(z, t) is provided by the two-slit interference of 
monochromatic light ([link]). (See also Electromagnetic Waves and Interference.) The wave function of a light 
wave is given by E(x,t), and its energy density is given by |E|?, where E is the electric field strength. The energy 
of an individual photon depends only on the frequency of light, €photon = hf, so |E|? is proportional to the 
number of photons. When light waves from S interfere with light waves from Sp» at the viewing screen (a distance 
D away), an interference pattern is produced (part (a) of the figure). Bright fringes correspond to points of 
constructive interference of the light waves, and dark fringes correspond to points of destructive interference of the 
light waves (part (b)). 


Suppose the screen is initially unexposed to light. If the screen is exposed to very weak light, the interference 
pattern appears gradually ({link](c), left to right). Individual photon hits on the screen appear as dots. The dot 
density is expected to be large at locations where the interference pattern will be, ultimately, the most intense. In 
other words, the probability (per unit area) that a single photon will strike a particular spot on the screen is 
proportional to the square of the total electric field, ||? at that point. Under the right conditions, the same 
interference pattern develops for matter particles, such as electrons. 
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Two-slit interference of monochromatic light. (a) Schematic of two-slit interference; (b) light interference 
pattern; (c) interference pattern built up gradually under low-intensity light (left to right). 


Note: 
Visit this interactive simulation to learn more about quantum wave interference. 


The square of the matter wave ||? in one dimension has a similar interpretation as the square of the electric field 
|E|*. It gives the probability that a particle will be found at a particular position and time per unit length, also 
called the probability density. The probability (P) a particle is found in a narrow interval (x, x + dx) at time t is 
therefore 

Equation: 


P(z,« + dx) = |Y(a, t)|?dz. 


(Later, we define the magnitude squared for the general case of a function with “imaginary parts.”) This 
probabilistic interpretation of the wave function is called the Born interpretation. Examples of wave functions 
and their squares for a particular time t are given in [link]. 
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Several examples of wave functions and the 
corresponding square of their wave functions. 


If the wave function varies slowly over the interval Az, the probability a particle is found in the interval is 
approximately 
Equation: 


P(a,z + Az) © |G(z, t)|?Az. 


Notice that squaring the wave function ensures that the probability is positive. (This is analogous to squaring the 
electric field strength—which may be positive or negative—to obtain a positive value of intensity.) However, if the 
wave function does not vary slowly, we must integrate: 

Equation: 


z+Aaz 


P(z,2 + Az) = U(a, t)|?dx. 


This probability is just the area under the function |W(z, t)|? between x and x + Az. The probability of finding 
the particle “somewhere” (the normalization condition) is 


Note: 


Equation: 


For a particle in two dimensions, the integration is over an area and requires a double integral; for a particle in 
three dimensions, the integration is over a volume and requires a triple integral. For now, we stick to the simple 
one-dimensional case. 


Example: 

Where Is the Ball? (Part I) 

A ball is constrained to move along a line inside a tube of length L. The ball is equally likely to be found 
anywhere in the tube at some time t. What is the probability of finding the ball in the left half of the tube at that 
time? (The answer is 50%, of course, but how do we get this answer by using the probabilistic interpretation of 
the quantum mechanical wave function?) 

Strategy 

The first step is to write down the wave function. The ball is equally like to be found anywhere in the box, so one 
way to describe the ball with a constant wave function ([link]). The normalization condition can be used to find 


the value of the function and a simple integration over half of the box yields the final answer. 
W (x, 0) 


Wave function for a ball in a tube of length L. 


Solution 
The wave function of the ball can be written as U(x, t) = C(0 < a < L), where Cis a constant, and 


W(x, t) = 0 otherwise. We can determine the constant C by applying the normalization condition (we set t = 0 to 
simplify the notation): 
Equation: 


P(x = —00, +00) = ii C/?dx = 1. 


This integral can be broken into three parts: (1) negative infinity to zero, (2) zero to L, and (3) L to infinity. The 
particle is constrained to be in the tube, so C' = 0 outside the tube and the first and last integrations are zero. The 
above equation can therefore be written 


Equation: 
i 


peon- [ Cds — 1 


0 


The value C does not depend on x and can be taken out of the integral, so we obtain 
Equation: 


Integration gives 
Equation: 


C= oe 


To determine the probability of finding the ball in the first half of the box (0 < x < L), we have 
Equation: 


L/2 
P(« =0,L/2) / a 
16 = — = 
, L 
0 
Significance 


The probability of finding the ball in the first half of the tube is 50%, as expected. Two observations are 
noteworthy. First, this result corresponds to the area under the constant function from z = 0 to L/2 (the area of a 
square left of L/2). Second, this calculation requires an integration of the square of the wave function. A common 
mistake in performing such calculations is to forget to square the wave function before integration. 
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Example: 

Where Is the Ball? (Part IT) 

A ball is again constrained to move along a line inside a tube of length L. This time, the ball is found 
preferentially in the middle of the tube. One way to represent its wave function is with a simple cosine function 


({link]). What is the probability of finding the ball in the last one-quarter of the tube? 
V(x, 0) 


Wave function for a ball in a tube of length L, 
where the ball is preferentially in the middle of 
the tube. 


Strategy 

We use the same strategy as before. In this case, the wave function has two unknown constants: One is associated 
with the wavelength of the wave and the other is the amplitude of the wave. We determine the amplitude by using 
the boundary conditions of the problem, and we evaluate the wavelength by using the normalization condition. 
Integration of the square of the wave function over the last quarter of the tube yields the final answer. The 
calculation is simplified by centering our coordinate system on the peak of the wave function. 

Solution 

The wave function of the ball can be written 

Equation: 


W(x, 0) = Acos(kx)(—L/2 < « < L/2), 


where A is the amplitude of the wave function and k = 2r/X is its wave number. Beyond this interval, the 
amplitude of the wave function is zero because the ball is confined to the tube. Requiring the wave function to 
terminate at the right end of the tube gives 
L 
wv (« =—, 0) = (0); 
2 


Equation: 
Evaluating the wave function at z = L/2 gives 
Equation: 


A cos (kL /2) = 0. 


This equation is satisfied if the argument of the cosine is an integral multiple of 7/2, 37/2, 57/2, and so on. In 
this case, we have 


Equation: 
ele 
a Tis 
or 
Equation: 
T 
i= =. 
L 


Applying the normalization condition gives A = af 2/L, so the wave function of the ball is 
Equation: 


W(a,0) = = cos (na/L), -—L/2<a< L/2. 


To determine the probability of finding the ball in the last quarter of the tube, we square the function and 
integrate: 
Equation: 


L/2 
P(« = L/4,L/2) = he 


Significance 

The probability of finding the ball in the last quarter of the tube is 9.1%. The ball has a definite wavelength 
(A = 2L). If the tube is of macroscopic length (ZL = 1m), the momentum of the ball is 

Equation: 
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This momentum is much too small to be measured by any human instrument. 


An Interpretation of the Wave Function 


We are now in position to begin to answer the questions posed at the beginning of this section. First, for a traveling 
particle described by Y (x,t) = Asin(ka — wt), what is “waving?” Based on the above discussion, the answer is 
a mathematical function that can, among other things, be used to determine where the particle is likely to be when 
a position measurement is performed. Second, how is the wave function used to make predictions? [f it is 
necessary to find the probability that a particle will be found in a certain interval, square the wave function and 
integrate over the interval of interest. Soon, you will learn soon that the wave function can be used to make many 
other kinds of predictions, as well. 


Third, if a matter wave is given by the wave function V(x, t), where exactly is the particle? Two answers exist: (1) 
when the observer is not looking (or the particle is not being otherwise detected), the particle is everywhere 

(a = —oo, +00); and (2) when the observer is looking (the particle is being detected), the particle “jumps into” a 
particular position state (x, z + dz) with a probability given by P(x, x + dx) = |W(z, t)|?dx —a process called 
state reduction or wave function collapse. This answer is called the Copenhagen interpretation of the wave 
function, or of quantum mechanics. 


To illustrate this interpretation, consider the simple case of a particle that can occupy a small container either at x1 
Or £2 ([link]). In classical physics, we assume the particle is located either at x; or 22 when the observer is not 
looking. However, in quantum mechanics, the particle may exist in a state of indefinite position—that is, it may be 
located at x; and x when the observer is not looking. The assumption that a particle can only have one value of 
position (when the observer is not looking) is abandoned. Similar comments can be made of other measurable 
quantities, such as momentum and energy. 


A two-state system of position of a particle. 


The bizarre consequences of the Copenhagen interpretation of quantum mechanics are illustrated by a creative 
thought experiment first articulated by Erwin Schrédinger (National Geographic, 2013) ({link]): 


“A cat is placed in a steel box along with a Geiger counter, a vial of poison, a hammer, and a radioactive substance. 
When the radioactive substance decays, the Geiger detects it and triggers the hammer to release the poison, which 
subsequently kills the cat. The radioactive decay is a random [probabilistic] process, and there is no way to predict 


when it will happen. Physicists say the atom exists in a state known as a superposition—both decayed and not 
decayed at the same time. Until the box is opened, an observer doesn’t know whether the cat is alive or dead— 
because the cat’s fate is intrinsically tied to whether or not the atom has decayed and the cat would [according to 
the Copenhagen interpretation] be “living and dead ... in equal parts” until it is observed.” 


Schrédinger’s cat. 


Schrédinger took the absurd implications of this thought experiment (a cat simultaneously dead and alive) as an 
argument against the Copenhagen interpretation. However, this interpretation remains the most commonly taught 
view of quantum mechanics. 


Two-state systems (left and right, atom decays and does not decay, and so on) are often used to illustrate the 
principles of quantum mechanics. These systems find many applications in nature, including electron spin and 
mixed states of particles, atoms, and even molecules. Two-state systems are also finding application in the 
quantum computer, as mentioned in the introduction of this chapter. Unlike a digital computer, which encodes 
information in binary digits (zeroes and ones), a quantum computer stores and manipulates data in the form of 
quantum bits, or qubits. In general, a qubit is not in a state of zero or one, but rather in a mixed state of zero and 
one. If a large number of qubits are placed in the same quantum state, the measurement of an individual qubit 
would produce a zero with a probability p, and a one with a probability q = 1 — p. Many scientists believe that 
quantum computers are the future of the computer industry. 


Complex Conjugates 


Later in this section, you will see how to use the wave function to describe particles that are “free” or bound by 
forces to other particles. The specific form of the wave function depends on the details of the physical system. A 
peculiarity of quantum theory is that these functions are usually complex functions. A complex function is one 
that contains one or more imaginary numbers (i = /—1). Experimental measurements produce real 
(nonimaginary) numbers only, so the above procedure to use the wave function must be slightly modified. In 
general, the probability that a particle is found in the narrow interval (x, x + dx) at time t is given by 


Note: 
Equation: 


Pace) = |W (a, t)|?de = W" (a, t)¥ (a, t)dx, 


where (a, t) is the complex conjugate of the wave function. The complex conjugate of a function is obtaining 
by replacing every occurrence of 7 = /—1 in that function with —72. This procedure eliminates complex numbers 
in all predictions because the product W* (x, t) (a, t) is always a real number. 


Note: 
Exercise: 


Problem: Check Your Understanding If a = 3 + 42, what is the product aa? 


Solution: 


(3 + 47) (3 — 41) = 9 — 16:7 = 25 


Consider the motion of a free particle that moves along the x-direction. As the name suggests, a free particle 
experiences no forces and so moves with a constant velocity. As we will see in a later section of this chapter, a 
formal quantum mechanical treatment of a free particle indicates that its wave function has real and complex parts. 
In particular, the wave function is given by 

Equation: 


W (x,t) = Acos (kx — wt) + iAsin (ka — ut), 
where A is the amplitude, k is the wave number, and w is the angular frequency. Using Euler’s formula, 


e’? = cos() + isin (¢), this equation can be written in the form 
Equation: 


W (x,t) = Aeil*r—“t) — Ae’, 
where ¢ is the phase angle. If the wave function varies slowly over the interval Az, the probability of finding the 
particle in that interval is 
Equation: 
P(a,x + Az) = W" (a, t)¥ (2, t)Ax = (Ae’*) (4°e*) Aa = (A'A)Az. 
If Ahas real and complex parts (a + ib, where a and b are real constants), then 
Equation: 
A’ A = (a+ ib) (a — ib) = a? +8’. 


Notice that the complex numbers have vanished. Thus, 
Equation: 


P(a,z+ Az) = |A|?Az 


is areal quantity. The interpretation of UV" (a, t)V (a, t) as a probability density ensures that the predictions of 
quantum mechanics can be checked in the “real world.” 


Note: 
Exercise: 


Problem: 


Check Your Understanding Suppose that a particle with energy E is moving along the x-axis and is 
confined in the region between 0 and L. One possible wave function is 
Equation: 


—iEt/h s, AL < < 
HCN Ae sin +, wien 0 ga< L 
0, otherwise 


Determine the normalization constant. 


Solution: 


A= 9k 


Expectation Values 


In classical mechanics, the solution to an equation of motion is a function of a measurable quantity, such as x(t), 
where x is the position and t is the time. Note that the particle has one value of position for any time t. In quantum 
mechanics, however, the solution to an equation of motion is a wave function, Y (a, t). The particle has many 
values of position for any time t, and only the probability density of finding the particle, |W(z, t)|?, can be known. 
The average value of position for a large number of particles with the same wave function is expected to be 
Equation: 


This is called the expectation value of the position. It is usually written 


Note: 
Equation: 


co 


(C)— J (x, t)xW (a, t)dz, 


—oo 


where the x is sandwiched between the wave functions. The reason for this will become apparent soon. Formally, x 
is called the position operator. 


At this point, it is important to stress that a wave function can be written in terms of other quantities as well, such 
as velocity (v), momentum (p), and kinetic energy (K). The expectation value of momentum, for example, can be 
written 

Equation: 


8 


(p) = | © (p,t)p¥ (p, t)dp, 


Where dp is used instead of dx to indicate an infinitesimal interval in momentum. In some cases, we know the 
wave function in position, Y (x, t), but seek the expectation of momentum. The procedure for doing this is 
Equation: 


co 


= |e ea) (-in)w tide. 


where the quantity in parentheses, sandwiched between the wave functions, is called the momentum operator in 
the x-direction. [The momentum operator in [link] is said to be the position-space representation of the momentum 
operator.] The momentum operator must act (operate) on the wave function to the right, and then the result must be 
multiplied by the complex conjugate of the wave function on the left, before integration. The momentum operator 
in the x-direction is sometimes denoted 

Equation: 


4 a 
(Dz) — Hh 


Momentum operators for the y- and z-directions are defined similarly. This operator and many others are derived in 
a more advanced course in modern physics. In some cases, this derivation is relatively simple. For example, the 
kinetic energy operator is just 


Equation: 
: 2 
(K) _ 1 ( : _ Ge _ (-i#-+) _ a d d 
op 9 Vt)op ~ Om 2m ~ Im \ dx dx }- 


Thus, if we seek an expectation value of kinetic energy of a particle in one dimension, two successive ordinary 
derivatives of the wave function are required before integration. 


Expectation-value calculations are often simplified by exploiting the symmetry of wave functions. Symmetric 
wave functions can be even or odd. An even function is a function that satisfies 
Equation: 


In contrast, an odd function is a function that satisfies 
Equation: 


(2) 


I 

| 
= 
2 


An example of even and odd functions is shown in [link]. An even function is symmetric about the y-axis. This 
function is produced by reflecting ~ (x) for x > 0 about the vertical y-axis. By comparison, an odd function is 
generated by reflecting the function about the y-axis and then about the x-axis. (An odd function is also referred to 
as an anti-symmetric function.) 


—0.57 


Examples of even and odd wave 
functions. 


In general, an even function times an even function produces an even function. A simple example of an even 
function is the product ae-® (even times even is even). Similarly, an odd function times an odd function 
produces an even function, such as x sin x (odd times odd is even). However, an odd function times an even 
function produces an odd function, such as te* (odd times even is odd). The integral over all space of an odd 
function is zero, because the total area of the function above the x-axis cancels the (negative) area below it. As the 
next example shows, this property of odd functions is very useful. 


Example: 

Expectation Value (Part I) 

The normalized wave function of a particle is 
Equation: 


Uo) = el» //a5. 


Find the expectation value of position. 

Strategy 

Substitute the wave function into [link] and evaluate. The position operator introduces a multiplicative factor only, 
so the position operator need not be “sandwiched.” 


Solution 
First multiply, then integrate: 
Equation: 

+00 +00 % +00 

- e |#l/2o 1 . 
(z) = [ axeiv(o) = [axe sh [ eve lel/eo — Q, 
/x0 oa) 

69 = 00) 

Significance 


The function in the integrand (xe~?!*!/*») is odd since it is the product of an odd function (x) and an even function 
(e~?/#|/*0), The integral vanishes because the total area of the function about the x-axis cancels the (negative) area 


below it. The result (() = 0) is not surprising since the probability density function is symmetric about 2 = 0. 


Example: 

Expectation Value (Part IT) 

The time-dependent wave function of a particle confined to a region between 0 and L is 
Equation: 


(x,t) = Ae“ sin(nx/L) 


where w is angular frequency and E is the energy of the particle. (Note: The function varies as a sine because of 
the limits (0 to L). When « = 0, the sine factor is zero and the wave function is zero, consistent with the boundary 
conditions.) Calculate the expectation values of position, momentum, and kinetic energy. 

Strategy 

We must first normalize the wave function to find A. Then we use the operators to calculate the expectation 
values. 


Solution 
Computation of the normalization constant: 
Equation: 

L L i Z . 
ih [ ev" @ue@) = fe (Act sin ™) (4c sin ™) = a& f ae sin? = A? AG / 

L L L 2 
0 0 0 
The expectation value of position is 
Equation: 
ib i L rR e 

(x) = [av @ev@) = je (emt sin ™)z Co sin ~) AS [axe sin? = ee eer 

0 0 0 


The expectation value of momentum in the x-direction also requires an integral. To set this integral up, the 
associated operator must— by rule—act to the right on the wave function w(z): 
Equation: 


d d ha 
—ih—wv(z) = —ih— Ae sin — = —i—e™* cos — . 
dz dx 


Therefore, the expectation value of momentum is 
Equation: 


e i 
2 
= re a) wi gAl eos me \ _ Ath _ me 
(p) fe (4e sin —— ( tape COS > tr dz sin ane 0. 
0 0 


The function in the integral is a sine function with a wavelength equal to the width of the well, L—an odd 
function about « = L/2. As a result, the integral vanishes. 

The expectation value of kinetic energy in the x-direction requires the associated operator to act on the wave 
function: 


Equation: 
i a i da? Tx ih? Bos Ta Ah? = 1x 
Pape ee tem cciate teal A ee ee a = So —iwt : = —iwt _: pane 
DeLee arpa ches We  eahe sae ee 


Thus, the expectation value of the kinetic energy is 


Equation: 


L 

_ _Azh? 9 Ml A Db Be 

= [ axsin TL. smi? 2 ~ &mi?* 
0 


Significance 

The average position of a large number of particles in this state is L/2. The average momentum of these particles 
is zero because a given particle is equally likely to be moving right or left. However, the particle is not at rest 
because its average kinetic energy is not zero. Finally, the probability density is 

Equation: 


|p|? = (2/L)sin?(ma/L). 


This probability density is largest at location L/2 and is zero at x = 0 and at x = L. Note that these conclusions 
do not depend explicitly on time. 


Note: 
Exercise: 


Problem: 


Check Your Understanding For the particle in the above example, find the probability of locating it 
between positions 0 and L/4 


Solution: 


(1/2 —1/n)/2=9% 


Quantum mechanics makes many surprising predictions. However, in 1920, Niels Bohr (founder of the Niels Bohr 
Institute in Copenhagen, from which we get the term “Copenhagen interpretation”) asserted that the predictions of 
quantum mechanics and classical mechanics must agree for all macroscopic systems, such as orbiting planets, 
bouncing balls, rocking chairs, and springs. This correspondence principle is now generally accepted. It suggests 
the rules of classical mechanics are an approximation of the rules of quantum mechanics for systems with very 
large energies. Quantum mechanics describes both the microscopic and macroscopic world, but classical 
mechanics describes only the latter. 


Summary 


e In quantum mechanics, the state of a physical system is represented by a wave function. 

e In Born’s interpretation, the square of the particle’s wave function represents the probability density of 
finding the particle around a specific location in space. 

e Wave functions must first be normalized before using them to make predictions. 

e The expectation value is the average value of a quantity that requires a wave function and an integration. 


Conceptual Questions 


Exercise: 


Problem: 


What is the physical unit of a wave function, Y(x, t)? What is the physical unit of the square of this wave 
function? 


Solution: 
1/vL, where L = length; 1/L, where L = length 


Exercise: 


Problem: Can the magnitude of a wave function (W" (x, t) U(a, t)) be a negative number? Explain. 
Exercise: 

Problem: What kind of physical quantity does a wave function of an electron represent? 

Solution: 


The wave function does not correspond directly to any measured quantity. It is a tool for predicting the values 
of physical quantities. 


Exercise: 


Problem: What is the physical meaning of a wave function of a particle? 


Exercise: 


Problem: What is the meaning of the expression “expectation value?” Explain. 
Solution: 


The average value of the physical quantity for a large number of particles with the same wave function. 


Problems 


Exercise: 


Problem: Compute |¥(z, t)| 2 for the function U(x,t) = p(x) sin wt, where w is a real constant. 
Solution: 
v(x) | sin wt 

Exercise: 


Problem: Given the complex-valued function f(z, y) = (x — iy)/(x + ty), calculate | f(z, y)|?. 


Exercise: 


Problem: 


Which one of the following functions, and why, qualifies to be a wave function of a particle that can move 
along the entire real axis? (a) (x) = Ae” ; 

(b) w(x) = Ae~*; (C Y(a) = Atan az; 

(d) (x) = A(sin x) /x; (e) (a) = Ae". 


Solution: 


(a) and (e), can be normalized 
Exercise: 
Problem: 
A particle with mass m moving along the x-axis and its quantum state is represented by the following wave 
function: 
0, «<0, 
Axe—% et Bt/h, z>0, 


U(zx,t) = { 


where a = 2.0 x 10?°m~1. (a) Find the normalization constant. (b) Find the probability that the particle can 
be found on the interval 0 < x < ZL. (c) Find the expectation value of position. (d) Find the expectation value 
of kinetic energy. 


Exercise: 


Problem: A wave function of a particle with mass m is given by 


2a? 


Y(z) = 


a =a SES Se 


0, otherwise, 


where a = 1.00 x 102? /m. (a) Find the normalization constant. (b) Find the probability that the particle can 
be found on the interval 0 < « < 0.5 x 107!m. (© Find the particle’s average position. (d) Find its 
average momentum. (e) Find its average kinetic energy —0.5 x 10°1°m < x < +0.5 x 107!°m. 


Solution: 


a. A = \/2a/m; b. probability = 29.3%; c. (x) = 0; d. (p) = 0; e. (K) = a?h?/2m 


Glossary 


anti-symmetric function 
odd function 


Born interpretation 
states that the square of a wave function is the probability density 


complex function 
function containing both real and imaginary parts 


Copenhagen interpretation 
states that when an observer is not looking or when a measurement is not being made, the particle has many 
values of measurable quantities, such as position 


correspondence principle 


in the limit of large energies, the predictions of quantum mechanics agree with the predictions of classical 
mechanics 


expectation value 
average value of the physical quantity assuming a large number of particles with the same wave function 


even function 
in one dimension, a function symmetric with the origin of the coordinate system 


momentum operator 
operator that corresponds to the momentum of a particle 


normalization condition 
requires that the probability density integrated over the entire physical space results in the number one 


odd function 
in one dimension, a function antisymmetric with the origin of the coordinate system 


position operator 
operator that corresponds to the position of a particle 


probability density 
square of the particle’s wave function 


state reduction 
hypothetical process in which an observed or detected particle “jumps into” a definite state, often described in 
terms of the collapse of the particle’s wave function 


wave function 
function that represents the quantum state of a particle (quantum system) 


wave function collapse 
equivalent to state reduction 


The Heisenberg Uncertainty Principle 
By the end of this section, you will be able to: 


¢ Describe the physical meaning of the position-momentum uncertainty relation 
e Explain the origins of the uncertainty principle in quantum theory 
e Describe the physical meaning of the energy-time uncertainty relation 


Heisenberg’s uncertainty principle is a key principle in quantum mechanics. Very 
roughly, it states that if we know everything about where a particle is located (the 
uncertainty of position is small), we know nothing about its momentum (the uncertainty 
of momentum is large), and vice versa. Versions of the uncertainty principle also exist 
for other quantities as well, such as energy and time. We discuss the momentum-position 
and energy-time uncertainty principles separately. 


Momentum and Position 


To illustrate the momentum-position uncertainty principle, consider a free particle that 
moves along the x-direction. The particle moves with a constant velocity u and 
momentum p = mu. According to de Broglie’s relations, p = hk and EF = hw. As 
discussed in the previous section, the wave function for this particle is given by 
Equation: 


,(x,t) = Alcos(wt — kx) —isin(wt — kx)] = Ae~**t-*2) — Ae-tvtethe 


and the probability density |,(a,t)|? = A? is uniform and independent of time. The 
particle is equally likely to be found anywhere along the x-axis but has definite values of 
wavelength and wave number, and therefore momentum. The uncertainty of position is 
infinite (we are completely uncertain about position) and the uncertainty of the 
momentum is zero (we are completely certain about momentum). This account of a free 
particle is consistent with Heisenberg’s uncertainty principle. 


Similar statements can be made of localized particles. In quantum theory, a localized 
particle is modeled by a linear superposition of free-particle (or plane-wave) states called 
a wave packet. An example of a wave packet is shown in [link]. A wave packet contains 
many wavelengths and therefore by de Broglie’s relations many momenta—possible in 
quantum mechanics! This particle also has many values of position, although the particle 
is confined mostly to the interval Aw. The particle can be better localized (Az can be 
decreased) if more plane-wave states of different wavelengths or momenta are added 
together in the right way (Ap is increased). According to Heisenberg, these uncertainties 
obey the following relation. 


Note: 

The Heisenberg Uncertainty Principle 

The product of the uncertainty in position of a particle and the uncertainty in its 
momentum can never be less than one-half of the reduced Planck constant: 
Equation: 


Az Ap > i/2. 


This relation expresses Heisenberg’s uncertainty principle. It places limits on what we 
can know about a particle from simultaneous measurements of position and momentum. 
If Az is large, Ap is small, and vice versa. [link] can be derived in a more advanced 
course in modern physics. Reflecting on this relation in his work The Physical Principles 
of the Quantum Theory, Heisenberg wrote “Any use of the words ‘position’ and 
‘velocity’ with accuracy exceeding that given by [the relation] is just as meaningless as 
the use of words whose sense is not defined.” 


Several plane waves Wave packet 
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Adding together several plane waves of different 
wavelengths can produce a wave that is relatively 
localized. 


Note that the uncertainty principle has nothing to do with the precision of an 
experimental apparatus. Even for perfect measuring devices, these uncertainties would 
remain because they originate in the wave-like nature of matter. The precise value of the 
product Az Ap depends on the specific form of the wave function. Interestingly, the 
Gaussian function (or bell-curve distribution) gives the minimum value of the 
uncertainty product: Az Ap = h/2. 


Example: 

The Uncertainty Principle Large and Small 

Determine the minimum uncertainties in the positions of the following objects if their 
speeds are known with a precision of 1.0 x 10~°m/s: (a) an electron and (b) a 
bowling ball of mass 6.0 kg. 

Strategy 

Given the uncertainty in speed Au = 1.0 x 10~*m/s, we have to first determine the 
uncertainty in momentum Ap = m Au and then invert [link] to find the uncertainty in 
position Ax = h/(2Ap). 

Solution 


a. For the electron: 
Equation: 
Ap = mAu=(9.1 x 107° kg)(1.0 x 10-°m/s) = 9.1 x 10°“ kg- m/s, 


i oe = 5 scm. 


b. For the bowling ball: 


Equation: 
Ap = mAu= (6.0kg)(1.0 x 10°-3m/s) = 6.0 x 10°-*kg- m/s, 
Az = zh =88 x 10°*m. 
Significance 


Unlike the position uncertainty for the electron, the position uncertainty for the bowling 
ball is immeasurably small. Planck’s constant is very small, so the limitations imposed 
by the uncertainty principle are not noticeable in macroscopic systems such as a 
bowling ball. 


Example: 
Uncertainty and the Hydrogen Atom 


Estimate the ground-state energy of a hydrogen atom using Heisenberg’s uncertainty 
principle. (Hint: According to early experiments, the size of a hydrogen atom is 
approximately 0.1 nm.) 

Strategy 

An electron bound to a hydrogen atom can be modeled by a particle bound to a one- 
dimensional box of length L = 0.1 nm. The ground-state wave function of this system 
is a half wave, like that given in [link]. This is the largest wavelength that can “fit” in 
the box, so the wave function corresponds to the lowest energy state. Note that this 
function is very similar in shape to a Gaussian (bell curve) function. We can take the 
average energy of a particle described by this function (E) as a good estimate of the 
ground state energy (Eo). This average energy of a particle is related to its average of 
the momentum squared, which is related to its momentum uncertainty. 

Solution 

To solve this problem, we must be specific about what is meant by “uncertainty of 
position” and “uncertainty of momentum.” We identify the uncertainty of position (Az) 
with the standard deviation of position (o,,), and the uncertainty of momentum (Ap) 
with the standard deviation of momentum (¢,). For the Gaussian function, the 
uncertainty product is 

Equation: 


Oz7Op = 9? 


where 
Equation: 


2 2 2 


a — f. and Gs =i =e 

The particle is equally likely to be moving left as moving right, so p = 0. Also, the 
uncertainty of position is comparable to the size of the box, soo, = L. The estimated 
ground state energy is therefore 


Equation: 
mo i fey? i fie ie 
Ey = Eee = SS = ee) a = soe 
m 2m 2m \ 20, 2m \ 2L 8mL? 
Multiplying numerator and denominator by c? gives 
Equation: 
fic)? 197.3 eV - nm)? 
Eo am = pa Is = 0.952eV = 1leV. 


~ 8(me?)L? 8 (0.511 - 10%eV) (0.1 nm)” 


Significance 


Based on early estimates of the size of a hydrogen atom and the uncertainty principle, 
the ground-state energy of a hydrogen atom is in the eV range. The ionization energy of 
an electron in the ground-state energy is approximately 10 eV, so this prediction is 
roughly confirmed. (Note: The product fc is often a useful value in performing 
calculations in quantum mechanics.) 


Energy and Time 


Another kind of uncertainty principle concerns uncertainties in simultaneous 
measurements of the energy of a quantum state and its lifetime, 


Note: 
Equation: 


AEAt 


|V 
wo | > 


y 


where AF is the uncertainty in the energy measurement and At is the uncertainty in the 
lifetime measurement. The energy-time uncertainty principle does not result from a 
relation of the type expressed by [link] for technical reasons beyond this discussion. 
Nevertheless, the general meaning of the energy-time principle is that a quantum state 
that exists for only a short time cannot have a definite energy. The reason is that the 
frequency of a state is inversely proportional to time and the frequency connects with the 
energy of the state, so to measure the energy with good precision, the state must be 
observed for many cycles. 


To illustrate, consider the excited states of an atom. The finite lifetimes of these states 
can be deduced from the shapes of spectral lines observed in atomic emission spectra. 
Each time an excited state decays, the emitted energy is slightly different and, therefore, 
the emission line is characterized by a distribution of spectral frequencies (or 
wavelengths) of the emitted photons. As a result, all spectral lines are characterized by 
spectral widths. The average energy of the emitted photon corresponds to the theoretical 
energy of the excited state and gives the spectral location of the peak of the emission 
line. Short-lived states have broad spectral widths and long-lived states have narrow 
spectral widths. 


Example: 

Atomic Transitions 

An atom typically exists in an excited state for about At = 10~°’s. Estimate the 
uncertainty A f in the frequency of emitted photons when an atom makes a transition 
from an excited state with the simultaneous emission of a photon with an average 
frequency of f = 7.1 x 10’ Hz. Is the emitted radiation monochromatic? 

Strategy 

We invert [link] to obtain the energy uncertainty AE’ ~ #/2At and combine it with the 
photon energy & = hf to obtain A f. To estimate whether or not the emission is 
monochromatic, we evaluate A f/f. 


Solution 
The spread in photon energies is A F = hA f. Therefore, 
Equation: 
et es oe eee i = 6 
Af _ 80x10°Hz _ —8 
i 0 a = teh MU 
Significance 


Because the emitted photons have their frequencies within 1.1 x 10° percent of the 
average frequency, the emitted radiation can be considered monochromatic. 


Note: 
Exercise: 


Problem: 


Check Your Understanding A sodium atom makes a transition from the first 
excited state to the ground state, emitting a 589.0-nm photon with energy 2.105 eV. 
If the lifetime of this excited state is 1.6 x 107~°s, what is the uncertainty in 
energy of this excited state? What is the width of the corresponding spectral line? 


Solution: 


4.1 x 10-8eV; 1.1 x 10->nm 


Summary 


e The Heisenberg uncertainty principle states that it is impossible to simultaneously 
measure the x-components of position and of momentum of a particle with an 


arbitrarily high precision. The product of experimental uncertainties is always larger 
than or equal to h/2. 

¢ The limitations of this principle have nothing to do with the quality of the 
experimental apparatus but originate in the wave-like nature of matter. 

e The energy-time uncertainty principle expresses the experimental observation that a 
quantum state that exists only for a short time cannot have a definite energy. 


Conceptual Questions 


Exercise: 
Problem: 
If the formalism of quantum mechanics is ‘more exact’ than that of classical 


mechanics, why don’t we use quantum mechanics to describe the motion of a 
leaping frog? Explain. 


Exercise: 
Problem: 


Can the de Broglie wavelength of a particle be known precisely? Can the position of 
a particle be known precisely? 


Solution: 
Yes, if its position is completely unknown. Yes, if its momentum is completely 
unknown. 


Exercise: 


Problem: 


Can we measure the energy of a free localized particle with complete precision? 
Exercise: 
Problem: 


Can we measure both the position and momentum of a particle with complete 
precision? 


Solution: 


No. According to the uncertainty principle, if the uncertainty on the particle’s 
position is small, the uncertainty on its momentum is large. Similarly, if the 
uncertainty on the particle’s position is large, the uncertainty on its momentum is 
small. 


Problems 


Exercise: 
Problem: 
A velocity measurement of an a-particle has been performed with a precision of 
0.02 mm/s. What is the minimum uncertainty in its position? 

Exercise: 
Problem: 
A gas of helium atoms at 273 K is in a cubical container with 25.0 cm on a side. (a) 
What is the minimum uncertainty in momentum components of helium atoms? (b) 


What is the minimum uncertainty in velocity components? (c) Find the ratio of the 
uncertainties in (b) to the mean speed of an atom in each direction. 


Solution: 


a. Ap > 2.11 x 10°-°4N-s;b. Av > 6.31 x 10~$m;c. 
Av///kpT/me = 5.94 x 10-7! 
Exercise: 
Problem: 
If the uncertainty in the y-component of a proton’s position is 2.0 pm, find the 
minimum uncertainty in the simultaneous measurement of the proton’s y- 


component of velocity. What is the minimum uncertainty in the simultaneous 
measurement of the proton’s x-component of velocity? 


Exercise: 


Problem: 


Some unstable elementary particle has a rest energy of 80.41 GeV and an 
uncertainty in rest energy of 2.06 GeV. Estimate the lifetime of this particle. 


Solution: 


Ar>1.6 x 107*°s 


Exercise: 


Problem: 


An atom in a metastable state has a lifetime of 5.2 ms. Find the minimum 
uncertainty in the measurement of energy of the excited state. 


Exercise: 


Problem: 


Measurements indicate that an atom remains in an excited state for an average time 
of 50.0 ns before making a transition to the ground state with the simultaneous 
emission of a 2.1-eV photon. (a) Estimate the uncertainty in the frequency of the 
photon. (b) What fraction of the photon’s average frequency is this? 


Solution: 


a. Af > 1.59 MHz; b. Aw/wo = 3.135 x 107° 
Exercise: 


Problem: 


Suppose an electron is confined to a region of length 0.1 nm (of the order of the size 
of a hydrogen atom). (a) What is the minimum uncertainty of its momentum? (b) 
What would the uncertainty in momentum be if the confined length region doubled 
to 0.2 nm? 


Glossary 


Heisenberg’s uncertainty principle 
places limits on what can be known from a simultaneous measurements of position 
and momentum; states that if the uncertainty on position is small then the 
uncertainty on momentum is large, and vice versa 


wave packet 
superposition of many plane matter waves that can be used to represent a localized 
particle 


energy-time uncertainty principle 
energy-time relation for uncertainties in the simultaneous measurements of the 
energy of a quantum state and of its lifetime 


The Schrodinger Equation 
By the end of this section, you will be able to: 


e Describe the role SchrOdinger’s equation plays in quantum mechanics 

e Explain the difference between time-dependent and -independent 
SchrOdinger’s equations 

e Interpret the solutions of SchrOdinger’s equation 


In the preceding two sections, we described how to use a quantum 
mechanical wave function and discussed Heisenberg’s uncertainty principle. 
In this section, we present a complete and formal theory of quantum 
mechanics that can be used to make predictions. In developing this theory, 
it is helpful to review the wave theory of light. For a light wave, the electric 
field E(x,t) obeys the relation 

Equation: 


OE 1 OE 


O22 2 2’ 


where c is the speed of light and the symbol 0 represents a partial 
derivative. (Recall from Oscillations that a partial derivative is closely 
related to an ordinary derivative, but involves functions of more than one 
variable. When taking the partial derivative of a function by a certain 
variable, all other variables are held constant.) A light wave consists of a 
very large number of photons, so the quantity | F(z, t) ? can interpreted as a 
probability density of finding a single photon at a particular point in space 
(for example, on a viewing screen). 


There are many solutions to this equation. One solution of particular 
importance is 
Equation: 


E(a,t) = Asin (ka — wt), 


where A is the amplitude of the electric field, k is the wave number, and w is 
the angular frequency. Combing this equation with [link] gives 


Equation: 


a 
C2 


According to de Broglie’s equations, we have p = hk and E = hw. 
Substituting these equations in [link] gives 


Equation: 
E 
p=, 
Cc 
or 
Equation: 
E = pe. 


Therefore, according to Einstein’s general energy-momentum equation 
({link]), [link] describes a particle with a zero rest mass. This is consistent 
with our knowledge of a photon. 


This process can be reversed. We can begin with the energy-momentum 
equation of a particle and then ask what wave equation corresponds to it. 
The energy-momentum equation of a nonrelativistic particle in one 
dimension is 
Equation: 

2 


E=— +U(z,t), 


2m 


where p is the momentum, m is the mass, and U is the potential energy of 
the particle. The wave equation that goes with it turns out to be a key 
equation in quantum mechanics, called SchrOdinger’s time-dependent 
equation. 


Note: 

The Schrodinger Time-Dependent Equation 

The equation describing the energy and momentum of a wave function is 
known as the SchrOdinger equation: 

Equation: 


As described in Potential Energy and Conservation of Energy, the force on 
the particle described by this equation is given by 
Equation: 


OU (a, t) 
| pega La 
Ox 


This equation plays a role in quantum mechanics similar to Newton’s 
second law in classical mechanics. Once the potential energy of a particle is 
specified—or, equivalently, once the force on the particle is specified—we 
can solve this differential equation for the wave function. The solution to 
Newton’s second law equation (also a differential equation) in one 
dimension is a function x(t) that specifies where an object is at any time t. 
The solution to Schrodinger’s time-dependent equation provides a tool— 
the wave function—that can be used to determine where the particle is 
likely to be. This equation can be also written in two or three dimensions. 
Solving Schrodinger’s time-dependent equation often requires the aid of a 
computer. 


Consider the special case of a free particle. A free particle experiences no 
force (F' = 0). Based on [link], this requires only that 
Equation: 


U (x,t) = Up = constant. 


For simplicity, we set Up = 0. SchrOdinger’s equation then reduces to 
Equation: 


nh? 0?W (z,t) - OW (z, t) 
2n = Ox? : Ot 


A valid solution to this equation is 
Equation: 


W (¢,t) = Aer), 


Not surprisingly, this solution contains an imaginary number (i = V/—1) 
because the differential equation itself contains an imaginary number. As 
stressed before, however, quantum-mechanical predictions depend only on 
| (a, t) * which yields completely real values. Notice that the real plane- 
wave solutions, Y (z,t) = Asin (ka — wt) and 

WY (x,t) = Acos (kx — wt), do not obey Schrédinger’s equation. The 
temptation to think that a wave function can be seen, touched, and felt in 
nature is eliminated by the appearance of an imaginary number. In 
SchrOdinger’s theory of quantum mechanics, the wave function is merely a 
tool for calculating things. 


If the potential energy function (U) does not depend on time, it is possible 
to show that 


Note: 
Equation: 


satisfies SchrOdinger’s time-dependent equation, where ~ (x) is a time- 
independent function and e~* is a space-independent function. In other 
words, the wave function is separable into two parts: a space-only part and 
a time-only part. The factor e *“” is sometimes referred to as a time- 
modulation factor since it modifies the space-only function. According to 
de Broglie, the energy of a matter wave is given by E = hw, where E is its 
total energy. Thus, the above equation can also be written as 

Equation: 


W (x,t) =o(x)e ". 


Any linear combination of such states (mixed state of energy or 
momentum) is also valid solution to this equation. Such states can, for 
example, describe a localized particle (see [link]) 


Note: 
Exercise: 


Problem: 


Check Your Understanding A particle with mass m is moving along 
the x-axis in a potential given by the potential energy function 

U(x) = 0.5mw*z?. Compute the product U(a,t) U (x) (2, t). 
Express your answer in terms of the time-independent wave function, 


v(x). 
Solution: 


0.5muw?xb(x) (z) 


Combining [link] and [link], Schrédinger’s time-dependent equation 
reduces to 


Note: 
Equation: 


where E is the total energy of the particle (a real number). This equation is 
called SchrOdinger’s time-independent equation. Notice that we use “big 
psi” (W) for the time-dependent wave function and “little psi” (q) for the 
time-independent wave function. The wave-function solution to this 
equation must be multiplied by the time-modulation factor to obtain the 
time-dependent wave function. 


In the next sections, we solve SchrOdinger’s time-independent equation for 
three cases: a quantum particle in a box, a simple harmonic oscillator, and a 
quantum barrier. These cases provide important lessons that can be used to 

solve more complicated systems. The time-independent wave function 

w (ax) solutions must satisfy three conditions: 


e (a) must be a continuous function. 

e The first derivative of ~ (x) with respect to space, dy (x) /dax, must be 
continuous, unless V (2) = oo. 

e w(x) must not diverge (“blow up”) at 2 = too. 


The first condition avoids sudden jumps or gaps in the wave function. The 
second condition requires the wave function to be smooth at all points, 
except in special cases. (In a more advanced course on quantum mechanics, 
for example, potential spikes of infinite depth and height are used to model 
solids). The third condition requires the wave function be normalizable. 
This third condition follows from Born’s interpretation of quantum 
mechanics. It ensures that | (a) ki is a finite number so we can use it to 
calculate probabilities. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Which of the following wave functions 
is a valid wave-function solution for Schrodinger’s equation? 


W(x) V(x) V(x) 


= 


x 


Solution: 


None. The first function has a discontinuity; the second function is 
double-valued; and the third function diverges so is not normalizable. 


Summary 


e The Schrodinger equation is the fundamental equation of wave 
quantum mechanics. It allows us to make predictions about wave 
functions. 

e When a particle moves in a time-independent potential, a solution of 
the time-dependent SchrOdinger equation is a product of a time- 
independent wave function and a time-modulation factor. 

e The SchrOdinger equation can be applied to many physical situations. 


Conceptual Questions 


Exercise: 


Problem: 
What is the difference between a wave function 7(2, y, z) and a wave 
function U(x, y, z, t) for the same particle? 
Exercise: 
Problem: 


If a quantum particle is in a stationary state, does it mean that it does 
not move? 


Solution: 
No, it means that predictions about the particle (expressed in terms of 
probabilities) are time-independent. 
Exercise: 
Problem: 
Explain the difference between time-dependent and -independent 
SchrOdinger’s equations. 
Exercise: 
Problem: 
Suppose a wave function is discontinuous at some point. Can this 


function represent a quantum state of some physical particle? Why? 
Why not? 


Solution: 


No, because the probability of the particle existing in a narrow 
(infinitesimally small) interval at the discontinuity is undefined. 


Problems 


Exercise: 


Problem: Combine [link] and [link] to show k* = _ 


Solution: 


2 
Carrying out the derivatives yields k? = a 


Exercise: 
Problem: 
Show that W (a, t) = Ae*(**-“*) is a valid solution to Schrédinger’s 
time-dependent equation. 
Exercise: 
Problem: 
Show that Y (2, t) = Asin (ka — wt) and 


WV (x,t) = Acos (kx — wt) do not obey Schrédinger’s time- 
dependent equation. 


Solution: 


Carrying out the derivatives (as above) for the sine function gives a 
cosine on the right side the equation, so the equality fails. The same 
occurs for the cosine solution. 


Exercise: 
Problem: 
Show that when W,(z, t) and W»2(z, t) are solutions to the time- 
dependent Schrodinger equation and A,B are numbers, then a function 


W(x, t) that is a superposition of these functions is also a solution: 
W(2,t) = AV,(z,t) + BW,(z,t). 


Exercise: 


Problem: 


A particle with mass m is described by the following wave function: 
W(x) = Acoskx + Bsin ka, where A, B, and k are constants. 
Assuming that the particle is free, show that this function is the 
solution of the stationary Schrodinger equation for this particle and 
find the energy that the particle has in this state. 


Solution: 


E = hk? /2m 
Exercise: 
Problem: 
Find the expectation value of the kinetic energy for the particle in the 


state, W (x,t) — Ae’(**-“*), What conclusion can you draw from your 
solution? 


Exercise: 
Problem: 
Find the expectation value of the square of the momentum squared for 


the particle in the state, Y (a, t) = Ae**-“*), What conclusion can 
you draw from your solution? 


Solution: 


hk”; The particle has definite momentum and therefore definite 
momentum squared. 

Exercise: 
Problem: 


A free proton has a wave function given by 
W(x, t) = Ae? (5.02 x 10" —8.00 x 10"°t) 
y) = ° 


The coefficient of x is inverse meters (m~*) and the coefficient on t is 
inverse seconds (s_‘). Find its momentum and energy. 


Glossary 


SchrOdinger’s time-dependent equation 
equation in space and time that allows us to determine wave functions 
of a quantum particle 


SchrOdinger’s time-independent equation 
equation in space that allows us to determine wave functions of a 
quantum particle; this wave function must be multiplied by a time- 
modulation factor to obtain the time-dependent wave function 


time-modulation factor 
factor e** that multiplies the time-independent wave function when 
the potential energy of the particle is time independent 


The Quantum Particle in a Box 
By the end of this section, you will be able to: 


¢ Describe how to set up a boundary-value problem for the stationary Schrodinger equation 

e Explain why the energy of a quantum particle in a box is quantized 

¢ Describe the physical meaning of stationary solutions to SchrOdinger’s equation and the 
connection of these solutions with time-dependent quantum states 

e Explain the physical meaning of Bohr’s correspondence principle 


In this section, we apply SchrOdinger’s equation to a particle bound to a one-dimensional box. 
This special case provides lessons for understanding quantum mechanics in more complex 
systems. The energy of the particle is quantized as a consequence of a standing wave condition 
inside the box. 


Consider a particle of mass m that is allowed to move only along the x-direction and its motion 
is confined to the region between hard and rigid walls located at x = 0 and at x = L ([link)). 
Between the walls, the particle moves freely. This physical situation is called the infinite 
square well, described by the potential energy function 

Equation: 


0. 0<@< LZ, 
oo, otherwise. 


U(x) = { 


Combining this equation with Schrddinger’s time-independent wave equation gives 


Note: 
Equation: 


=H P(a) 


2m dx? = Ey(z),forO0<2<L 
x 


where E is the total energy of the particle. What types of solutions do we expect? The energy 
of the particle is a positive number, so if the value of the wave function is positive (right side 
of the equation), the curvature of the wave function is negative, or concave down (left side of 
the equation). Similarly, if the value of the wave function is negative (right side of the 
equation), the curvature of the wave function is positive or concave up (left side of equation). 
This condition is met by an oscillating wave function, such as a sine or cosine wave. Since 
these waves are confined to the box, we envision standing waves with fixed endpoints at x = 0 
andz = L. 


The potential energy function that 
confines the particle in a one- 
dimensional box. 


Solutions ~(a) to this equation have a probabilistic interpretation. In particular, the square 
|~(x)|? represents the probability density of finding the particle at a particular location x. This 
function must be integrated to determine the probability of finding the particle in some interval 
of space. We are therefore looking for a normalizable solution that satisfies the following 
normalization condition: 

Equation: 


L 


[ aio? =1. 


0 


The walls are rigid and impenetrable, which means that the particle is never found beyond the 
wall. Mathematically, this means that the solution must vanish at the walls: 
Equation: 


We expect oscillating solutions, so the most general solution to this equation is 
Equation: 


w(x) = A, cos kx + By sin kx 
where k is the wave number, and A; and By are constants. Applying the boundary condition 


expressed by [link] gives 
Equation: 


Wx(0) = Axcos(k- 0) + Bysin(k- 0) = A; = 0. 


Because we have A; = 0, the solution must be 
Equation: 


Wx(x) = By sin ka. 


If By is zero, wx (x) = 0 for all values of x and the normalization condition, [link], cannot be 
satisfied. Assuming B; + 0, [link] for 2 = L then gives 
Equation: 


0 = Bysin(kL) => sin(kL) = 0 => kL = nz,n = 1,2,3.... 


We discard the n = 0 solution because (2) for this quantum number would be zero 
everywhere—an un-normalizable and therefore unphysical solution. Substituting [link] into 
[link] gives 
Equation: 
he a 
—— ——(B,sin (kax)) = E (By,sin (kz)). 
2m dx? 


Computing these derivatives leads to 
Equation: 


According to de Broglie, p = hk, so this expression implies that the total energy is equal to the 
kinetic energy, consistent with our assumption that the “particle moves freely.” Combining the 
results of [link] and [link] gives 


Note: 
Equation: 


Strange! A particle bound to a one-dimensional box can only have certain discrete (quantized) 
values of energy. Further, the particle cannot have a zero kinetic energy—it is impossible for a 


particle bound to a box to be “at rest.” 


To evaluate the allowed wave functions that correspond to these energies, we must find the 
normalization constant B,,. We impose the normalization condition [link] on the wave function 
Equation: 


Un(x) = Bysinnra/L 


Equation: 
z ZL i j ; 
l= [ atbaeP = [ exBisin? = Bf ae sin?" = Be— > B,=4/ =. 
L L 2 L 
0 0 


Hence, the wave functions that correspond to the energy values given in [link] are 


Note: 
Equation: 


a 
n(x) = ie sin —= jn =1,2,3,... 


For the lowest energy state or ground state energy, we have 


Equation: 
mh? 2 _ (Wx 
Ey = mE?’ (x) => /2 sin (=). 


All other energy states can be expressed as 


Equation: 
E, =n E,,%,(2) = 4/ — sin ( ) 


The index n is called the energy quantum number or principal quantum number. The state 
for n = 2 is the first excited state, the state for n = 3 is the second excited state, and so on. 
The first three quantum states (form = 1,2, and 3) of a particle in a box are shown in [link]. 


The wave functions in [link] are sometimes referred to as the “states of definite energy.” 
Particles in these states are said to occupy energy levels, which are represented by the 
horizontal lines in [link]. Energy levels are analogous to rungs of a ladder that the particle can 
“climb” as it gains or loses energy. 


The wave functions in [link] are also called stationary states and standing wave states. These 
functions are “stationary,” because their probability density functions, | (zx, t)|*, do not vary 
in time, and “standing waves” because their real and imaginary parts oscillate up and down 
like a standing wave—like a rope waving between two children on a playground. Stationary 
states are states of definite energy [[link]], but linear combinations of these states, such as 
w (x) = ay + ba (also solutions to Schr6dinger’s equation) are states of mixed energy. 

E 


Q7r2h2 
2mL2 


Arh? 
2mL2 


(a) (b) 


The first three quantum states of a quantum particle in a box for principal quantum 
numbers n = 1,2, and 3: (a) standing wave solutions and (b) allowed energy states. 


Energy quantization is a consequence of the boundary conditions. If the particle is not confined 
to a box but wanders freely, the allowed energies are continuous. However, in this case, only 
certain energies (#1, 4, 9F4,...) are allowed. The energy difference between adjacent 
energy levels is given by 

Equation: 


AEnsin = Ensi — En = (n+1)°Ey — 7 Ey = (2n+1)F\. 


Conservation of energy demands that if the energy of the system changes, the energy 
difference is carried in some other form of energy. For the special case of a charged particle 
confined to a small volume (for example, in an atom), energy changes are often carried away 
by photons. The frequencies of the emitted photons give us information about the energy 
differences (spacings) of the system and the volume of containment—the size of the “box” [see 
[link]. 


Example: 

A Simple Model of the Nucleus 

Suppose a proton is confined to a box of width L = 1.00 x 10~'*m (a typical nuclear 
radius). What are the energies of the ground and the first excited states? If the proton makes a 
transition from the first excited state to the ground state, what are the energy and the 
frequency of the emitted photon? 

Strategy 

If we assume that the proton confined in the nucleus can be modeled as a quantum particle in 
a box, all we need to do is to use [link] to find its energies &, and EK. The mass of a proton is 
m = 1.76 x 10-2’ kg. The emitted photon carries away the energy difference 

AE = E» — Ej. We can use the relation Ey = hf to find its frequency f. 

Solution 

The ground state: 

Equation: 


2x2 27. 10-43. s)° 
= we == ae ee — 3.28 x 10-833 = 2.05 MeV. 
2mL* 91.67 x 107-2” kg) (1.00 x 10~!4m) 


1 


The first excited state: Hy = 27, = 4(2.05 MeV) = 8.20 MeV. 
The energy of the emitted photon is 

Es = AE = E2 — E, = 8.20 MeV — 2.05 MeV = 6.15 MeV. 
The frequency of the emitted photon is 

Equation: 


E 15M 
fae ee eae x 107! Hz. 
h 4.14 x 10°*'MeV <s 


Significance 
This is the typical frequency of a gamma ray emitted by a nucleus. The energy of this photon 
is about 10 million times greater than that of a visible light photon. 


The expectation value of the position for a particle in a box is given by 


Equation: 


L L L 


(2) = 7 dxip, (x) xn (x) = death, (x)|? = / dex - Oy LL 


0 0 0 


We can also find the expectation value of the momentum or average momentum of a large 
number of particles in a given state: 
Equation: 


) = / de, (c) |-ih vale) 


L —s L 
= nme | d_ 2 os nTe _ 2 fh : NTa Nx 
aye any)? a a e/F sin 2 | - Bf de sin ™§ [cos 
0 
L 21m 
_ jah 1g in 21m _ _-nth L : —_ _ ph pn 
= [a ns Ss = —-147 oar dpsiny = —i5, -0=0. 
0 0 


Thus, for a particle in a state of definite energy, the average position is in the middle of the box 
and the average momentum of the particle is zero—as it would also be for a classical particle. 
Note that while the minimum energy of a classical particle can be zero (the particle can be at 
rest in the middle of the box), the minimum energy of a quantum particle is nonzero and given 
by [link]. The average particle energy in the nth quantum state—its expectation value of 
energy—is 

Equation: 


The result is not surprising because the standing wave state is a state of definite energy. Any 
energy measurement of this system must return a value equal to one of these allowed energies. 


Our analysis of the quantum particle in a box would not be complete without discussing Bohr’s 
correspondence principle. This principle states that for large quantum numbers, the laws of 
quantum physics must give identical results as the laws of classical physics. To illustrate how 
this principle works for a quantum particle in a box, we plot the probability density distribution 
Equation: 


Yn(z) ? 


2. 9 
oa ie 
= —sin*(nnz/L) 


for finding the particle around location x between the walls when the particle is in quantum 
state ~,,. [link] shows these probability distributions for the ground state, for the first excited 
state, and for a highly excited state that corresponds to a large quantum number. We see from 
these plots that when a quantum particle is in the ground state, it is most likely to be found 
around the middle of the box, where the probability distribution has the largest value. This is 
not so when the particle is in the first excited state because now the probability distribution has 
the zero value in the middle of the box, so there is no chance of finding the particle there. 
When a quantum particle is in the first excited state, the probability distribution has two 
maxima, and the best chance of finding the particle is at positions close to the locations of 
these maxima. This quantum picture is unlike the classical picture. 


YF 


IYI? 


~ANMMOOAAANNNNNNN 


The probability density distribution |7),,(x)|? for a quantum particle in a box for: (a) the 
ground state, n = 1; (b) the first excited state, n = 2; and, (c) the nineteenth excited 
state, 2 = 20. 


[=] 


cL 


The probability density of finding a classical particle between x and x + Az depends on how 
much time At the particle spends in this region. Assuming that its speed u is constant, this 
time is At = Az/u, which is also constant for any location between the walls. Therefore, the 


probability density of finding the classical particle at x is uniform throughout the box, and 
there is no preferable location for finding a classical particle. This classical picture is matched 
in the limit of large quantum numbers. For example, when a quantum particle is in a highly 
excited state, shown in [link], the probability density is characterized by rapid fluctuations and 
then the probability of finding the quantum particle in the interval Az does not depend on 
where this interval is located between the walls. 


Example: 

A Classical Particle in a Box 

A small 0.40-kg cart is moving back and forth along an air track between two bumpers located 
2.0 m apart. We assume no friction; collisions with the bumpers are perfectly elastic so that 
between the bumpers, the car maintains a constant speed of 0.50 m/s. Treating the cart as a 
quantum particle, estimate the value of the principal quantum number that corresponds to its 
classical energy. 

Strategy 

We find the kinetic energy K of the cart and its ground state energy /; as though it were a 
quantum particle. The energy of the cart is completely kinetic, so K = n? F, ([link]). Solving 
for n gives n = (K/E,)"/. 

Solution 

The kinetic energy of the cart is 

Equation: 


1 1 
K= smu = 5 (0.40 kg)(0.50 m/s)” = 0.050 J. 


The ground state of the cart, treated as a quantum particle, is 
Equation: 


2A2 (1.05 x 10-4J- s)” 
Wao es S) _ 4.700 « 10-85. 


Ei, = 2 2 
2mL 2(0.40 kg)(2.0 m) 


Therefore, n = (K/E,)'/? = (0.050/1.700 x 10-88) /? a2 10, 

Significance 

We see from this example that the energy of a classical system is characterized by a very large 
quantum number. Bohr’s correspondence principle concerns this kind of situation. We can 
apply the formalism of quantum mechanics to any kind of system, quantum or classical, and 
the results are correct in each case. In the limit of high quantum numbers, there is no 
advantage in using quantum formalism because we can obtain the same results with the less 
complicated formalism of classical mechanics. However, we cannot apply classical formalism 
to a quantum system in a low-number energy state. 


Note: 


Exercise: 


Problem: 


Check Your Understanding (a) Consider an infinite square well with wall boundaries 
x = Oand x = L. What is the probability of finding a quantum particle in its ground 
state somewhere between x = 0 and x = L/4? (b) Repeat question (a) for a classical 
particle. 


Solution: 


a. 9.1%; b. 25% 


Having found the stationary states w,,(a) and the energies E,, by solving the time-independent 
SchrGdinger equation [link], we use [link] to write wave functions Y,,(z, ¢) that are solutions 
of the time-dependent Schrddinger’s equation given by [link]. For a particle in a box this gives 
Equation: 


W,(2,t) =e "sb, (2) = | Zentzun sin —n a te Re ee 


where the energies are given by [link]. 


The quantum particle in a box model has practical applications in a relatively newly emerged 
field of optoelectronics, which deals with devices that convert electrical signals into optical 
signals. This model also deals with nanoscale physical phenomena, such as a nanoparticle 
trapped in a low electric potential bounded by high-potential barriers. 


Summary 


e Energy states of a quantum particle in a box are found by solving the time-independent 
SchrOdinger equation. 

¢ To solve the time-independent Schrddinger equation for a particle in a box and find the 
stationary states and allowed energies, we require that the wave function terminate at the 
box wall. 

e Energy states of a particle in a box are quantized and indexed by principal quantum 
number. 

e The quantum picture differs significantly from the classical picture when a particle is in a 
low-energy state of a low quantum number. 

e In the limit of high quantum numbers, when the quantum particle is in a highly excited 
state, the quantum description of a particle in a box coincides with the classical 
description, in the spirit of Bohr’s correspondence principle. 


Conceptual Questions 


Exercise: 
Problem: 
Using the quantum particle in a box model, describe how the possible energies of the 
particle are related to the size of the box. 
Exercise: 
Problem: 
Is it possible that when we measure the energy of a quantum particle in a box, the 


measurement may return a smaller value than the ground state energy? What is the 
highest value of the energy that we can measure for this particle? 


Solution: 


No. For an infinite square well, the spacing between energy levels increases with the 
quantum number n. The smallest energy measured corresponds to the transition from n = 
2 to 1, which is three times the ground state energy. The largest energy measured 
corresponds to a transition from n = oo to 1, which is infinity. (Note: Even particles with 
extremely large energies remain bound to an infinite square well—they can never 
“escape”) 


Exercise: 
Problem: 
For a quantum particle in a box, the first excited state (W2) has zero value at the midpoint 
position in the box, so that the probability density of finding a particle at this point is 
exactly zero. Explain what is wrong with the following reasoning: “If the probability of 
finding a quantum particle at the midpoint is zero, the particle is never at this point, right? 


How does it come then that the particle can cross this point on its way from the left side to 
the right side of the box? 


Problems 


Exercise: 


Problem: 
Assume that an electron in an atom can be treated as if it were confined to a box of width 
2.0 A. What is the ground state energy of the electron? Compare your result to the ground 


state kinetic energy of the hydrogen atom in the Bohr’s model of the hydrogen atom. 


Solution: 


9.4 eV, 64% 


Exercise: 
Problem: 
Assume that a proton in a nucleus can be treated as if it were confined to a one- 
dimensional box of width 10.0 fm. (a) What are the energies of the proton when it is in 
the states corresponding ton = 1, n = 2, and n = 3? (b) What are the energies of the 


photons emitted when the proton makes the transitions from the first and second excited 
states to the ground state? 


Exercise: 
Problem: 


An electron confined to a box has the ground state energy of 2.5 eV. What is the width of 
the box? 


Solution: 


0.38 nm 
Exercise: 
Problem: 
What is the ground state energy (in eV) of a proton confined to a one-dimensional box the 
size of the uranium nucleus that has a radius of approximately 15.0 fm? 
Exercise: 
Problem: 


What is the ground state energy (in eV) of an a-particle confined to a one-dimensional 
box the size of the uranium nucleus that has a radius of approximately 15.0 fm? 


Solution: 


1.82 MeV 
Exercise: 
Problem: 
To excite an electron in a one-dimensional box from its first excited state to its third 
excited state requires 20.0 eV. What is the width of the box? 
Exercise: 
Problem: 
An electron confined to a box of width 0.15 nm by infinite potential energy barriers emits 


a photon when it makes a transition from the first excited state to the ground state. Find 
the wavelength of the emitted photon. 


Solution: 


24.7 nm 
Exercise: 
Problem: 
If the energy of the first excited state of the electron in the box is 25.0 eV, what is the 
width of the box? 
Exercise: 
Problem: 


Suppose an electron confined to a box emits photons. The longest wavelength that is 
registered is 500.0 nm. What is the width of the box? 


Solution: 


6.03 A 
Exercise: 


Problem: 


Hydrogen Hz molecules are kept at 300.0 K in a cubical container with a side length of 
20.0 cm. Assume that you can treat the molecules as though they were moving in a one- 
dimensional box. (a) Find the ground state energy of the hydrogen molecule in the 
container. (b) Assume that the molecule has a thermal energy given by kgT'/2 and find 
the corresponding quantum number n of the quantum state that would correspond to this 
thermal energy. 


Exercise: 
Problem: 
An electron is confined to a box of width 0.25 nm. (a) Draw an energy-level diagram 
representing the first five states of the electron. (b) Calculate the wavelengths of the 
emitted photons when the electron makes transitions between the fourth and the second 


excited states, between the second excited state and the ground state, and between the 
third and the second excited states. 


Solution: 


a. 


Energy (meV) 


b. A553 = 12.9 nm, A3.4; = 25.8 nm, A443 = 29.4nm 
Exercise: 
Problem: 
An electron in a box is in the ground state with energy 2.0 eV. (a) Find the width of the 
box. (b) How much energy is needed to excite the electron to its first excited state? (c) If 


the electron makes a transition from an excited state to the ground state with the 
simultaneous emission of 30.0-eV photon, find the quantum number of the excited state? 


Glossary 


energy levels 
states of definite energy, often represented by horizontal lines in an energy “ladder” 
diagram 


energy quantum number 
index that labels the allowed energy states 


ground state energy 
lowest energy state in the energy spectrum 


infinite square well 
potential function that is zero in a fixed range and infinitely beyond this range 


principal quantum number 
energy quantum number 


standing wave state 
stationary state for which the real and imaginary parts of Y(, t) oscillate up and down 
like a standing wave (often modeled with sine and cosine functions) 


stationary state 
state for which the probability density function, |W (2, t)|”, does not vary in time 


The Quantum Harmonic Oscillator 
By the end of this section, you will be able to: 


e Describe the model of the quantum harmonic oscillator 

¢ Identify differences between the classical and quantum models of the harmonic 
oscillator 

e Explain physical situations where the classical and the quantum models coincide 


Oscillations are found throughout nature, in such things as electromagnetic waves, 
vibrating molecules, and the gentle back-and-forth sway of a tree branch. In previous 
chapters, we used Newtonian mechanics to study macroscopic oscillations, such as a 
block on a spring and a simple pendulum. In this chapter, we begin to study oscillating 
systems using quantum mechanics. We begin with a review of the classic harmonic 
oscillator. 


The Classic Harmonic Oscillator 


A simple harmonic oscillator is a particle or system that undergoes harmonic motion 
about an equilibrium position, such as an object with mass vibrating on a spring. In this 
section, we consider oscillations in one-dimension only. Suppose a mass moves back- 
and-forth along the 


x-direction about the equilibrium position, x = 0. In classical mechanics, the particle 
moves in response to a linear restoring force given by F°, = —kz, where x is the 
displacement of the particle from its equilibrium position. The motion takes place 
between two turning points, = +A, where A denotes the amplitude of the motion. The 


position of the object varies periodically in time with angular frequency w = / k/m, 
which depends on the mass m of the oscillator and on the force constant k of the net 
force, and can be written as 

Equation: 


z(t) = Acos(wt+¢). 


The total energy E of an oscillator is the sum of its kinetic energy K = mu”/2 and the 
elastic potential energy of the force U(x) = kx?/2, 
Equation: 


1 1 
E = —mv? + —ke?. 
2 z 2 


At turning points x = +A, the speed of the oscillator is zero; therefore, at these points, 
the energy of oscillation is solely in the form of potential energy E = k A?/2. The plot 


of the potential energy U(x) of the oscillator versus its position x is a parabola ([link]). 
The potential-energy function is a quadratic function of x, measured with respect to the 
equilibrium position. On the same graph, we also plot the total energy E of the oscillator, 
as a horizontal line that intercepts the parabola at x = +A. Then the kinetic energy K is 
represented as the vertical distance between the line of total energy and the potential 
energy parabola. 


—-A 0 +A 


The potential energy well of a classical harmonic oscillator: The motion is 
confined between turning points at x = —A and at x = +A. The energy of 
oscillations is E = kA?/2. 


In this plot, the motion of a classical oscillator is confined to the region where its kinetic 
energy is nonnegative, which is what the energy relation [link] says. Physically, it means 
that a classical oscillator can never be found beyond its turning points, and its energy 
depends only on how far the turning points are from its equilibrium position. The energy 
of a classical oscillator changes in a continuous way. The lowest energy that a classical 
oscillator may have is zero, which corresponds to a situation where an object is at rest at 
its equilibrium position. The zero-energy state of a classical oscillator simply means no 
oscillations and no motion at all (a classical particle sitting at the bottom of the potential 


well in [link]). When an object oscillates, no matter how big or small its energy may be, 
it spends the longest time near the turning points, because this is where it slows down 
and reverses its direction of motion. Therefore, the probability of finding a classical 
oscillator between the turning points is highest near the turning points and lowest at the 
equilibrium position. (Note that this is not a statement of preference of the object to go to 
lower energy. It is a statement about how quickly the object moves through various 
regions.) 


The Quantum Harmonic Oscillator 


One problem with this classical formulation is that it is not general. We cannot use it, for 
example, to describe vibrations of diatomic molecules, where quantum effects are 
important. A first step toward a quantum formulation is to use the classical expression 

k = mw? to limit mention of a “spring” constant between the atoms. In this way the 
potential energy function can be written in a more general form, 


Note: 
Equation: 


Combining this expression with the time-independent Schrddinger equation gives 


Note: 
Equation: 


i OP 
5- ee + = mush) = Ey(2). 


To solve [link]—that is, to find the allowed energies E and their corresponding wave 
functions (a2) —we require the wave functions to be symmetric about x = 0 (the 
bottom of the potential well) and to be normalizable. These conditions ensure that the 
probability density |4(a)|? must be finite when integrated over the entire range of x from 


—oo to +00. How to solve [link] is the subject of a more advanced course in quantum 
mechanics; here, we simply cite the results. The allowed energies are 


Note: 
Equation: 


The wave functions that correspond to these energies (the stationary states or states of 
definite energy) are 


Note: 
Equation: 


Wn(z) = Nne?* HH, (Bx), n =0,1,2,3,... 


where 3 = of mw/h, N,, is the normalization constant, and H,,(y) is a polynomial of 
degree n called a Hermite polynomial. The first four Hermite polynomials are 
Equation: 


Ao (y) =1 

Hy (y) = 2y 

Hp (y) = 4y’ — 2 
H3 (y) = 8y* — 12y 


A few sample wave functions are given in [link]. As the value of the principal number 
increases, the solutions alternate between even functions and odd functions about x = 0. 


Energy (in units of hv0) 


The first five wave functions of the quantum harmonic 
oscillator. The classical limits of the oscillator’s motion are 
indicated by vertical lines, corresponding to the classical 
turning points at z = +A of a classical particle with the same 
energy as the energy of a quantum oscillator in the state 
indicated in the figure. 


Example: 

Classical Region of Harmonic Oscillations 

Find the amplitude A of oscillations for a classical oscillator with energy equal to the 
energy of a quantum oscillator in the quantum state n. 

Strategy 

To determine the amplitude A, we set the classical energy E = kx?/2 = mw? A?/2 
equal to &,, given by [link]. 

Solution 

We obtain 

Equation: 


2 2 2 1 
Ee Aly 2 A, = 4/ —z En = 4/ ime fas = yf (2n +1) ; 
mw? mwu2 » mw 


Significance 

As the quantum number n increases, the energy of the oscillator and therefore the 
amplitude of oscillation increases (for a fixed natural angular frequency. For large n, the 
amplitude is approximately proportional to the square root of the quantum number. 


Several interesting features appear in this solution. Unlike a classical oscillator, the 
measured energies of a quantum oscillator can have only energy values given by [link]. 
Moreover, unlike the case for a quantum particle in a box, the allowable energy levels are 
evenly spaced, 

Equation: 


SnkD41, M44 
AB = By ~ By = FD Ry - ie ee es 


When a particle bound to such a system makes a transition from a higher-energy state to 
a lower-energy state, the smallest-energy quantum carried by the emitted photon is 
necessarily hf. Similarly, when the particle makes a transition from a lower-energy state 
to a higher-energy state, the smallest-energy quantum that can be absorbed by the particle 
is hf. A quantum oscillator can absorb or emit energy only in multiples of this smallest- 
energy quantum. This is consistent with Planck’s hypothesis for the energy exchanges 
between radiation and the cavity walls in the blackbody radiation problem. 


Example: 

Vibrational Energies of the Hydrogen Chloride Molecule 

The HC] diatomic molecule consists of one chlorine atom and one hydrogen atom. 
Because the chlorine atom is 35 times more massive than the hydrogen atom, the 
vibrations of the HCl molecule can be quite well approximated by assuming that the Cl 
atom is motionless and the H atom performs harmonic oscillations due to an elastic 
molecular force modeled by Hooke’s law. The infrared vibrational spectrum measured 
for hydrogen chloride has the lowest-frequency line centered at f = 8.88 x 10'° Hz. 
What is the spacing between the vibrational energies of this molecule? What is the force 
constant k of the atomic bond in the HCI molecule? 

Strategy 

The lowest-frequency line corresponds to the emission of lowest-frequency photons. 
These photons are emitted when the molecule makes a transition between two adjacent 
vibrational energy levels. Assuming that energy levels are equally spaced, we use [link] 


to estimate the spacing. The molecule is well approximated by treating the Cl atom as 
being infinitely heavy and the H atom as the mass m that performs the oscillations. 
Treating this molecular system as a classical oscillator, the force constant is found from 
the classical relation k = mw?. 

Solution 

The energy spacing is 

Equation: 


AE=hf =(4.14 x 10-%eV-s)(8.88 x 101? Hz) = 0.368 eV. 


The force constant is 
Equation: 


k= mw? = m(2rf)? = (1.67 x 1072 kg)(2m x 8.88 x 10! Hz)” = 520N/m. 


Significance 

The force between atoms in an HCl molecule is surprisingly strong. The typical energy 
released in energy transitions between vibrational levels is in the infrared range. As we 
will see later, transitions in between vibrational energy levels of a diatomic molecule 
often accompany transitions between rotational energy levels. 


Note: 
Exercise: 


Problem: 


Check Your Understanding The vibrational frequency of the hydrogen iodide HI 
diatomic molecule is 6.69 x 101% Hz. (a) What is the force constant of the 
molecular bond between the hydrogen and the iodine atoms? (b) What is the energy 
of the emitted photon when this molecule makes a transition between adjacent 
vibrational energy levels? 


Solution: 


a. 295 N/m; b. 0.277 eV 


The quantum oscillator differs from the classic oscillator in three ways: 


First, the ground state of a quantum oscillator is Hy = fAw/2, not zero. In the classical 
view, the lowest energy is zero. The nonexistence of a zero-energy state is common for 


all quantum-mechanical systems because of omnipresent fluctuations that are a 
consequence of the Heisenberg uncertainty principle. If a quantum particle sat motionless 
at the bottom of the potential well, its momentum as well as its position would have to be 
simultaneously exact, which would violate the Heisenberg uncertainty principle. 
Therefore, the lowest-energy state must be characterized by uncertainties in momentum 
and in position, so the ground state of a quantum particle must lie above the bottom of 
the potential well. 


Second, a particle in a quantum harmonic oscillator potential can be found with nonzero 
probability outside the interval —A < # < +A. Ina classic formulation of the problem, 
the particle would not have any energy to be in this region. The probability of finding a 
ground-state quantum particle in the classically forbidden region is about 16%. 


Third, the probability density distributions |7),(a)|? for a quantum oscillator in the 
ground low-energy state, w(x), is largest at the middle of the well (a = 0). For the 
particle to be found with greatest probability at the center of the well, we expect that the 
particle spends the most time there as it oscillates. This is opposite to the behavior of a 
classical oscillator, in which the particle spends most of its time moving with relative 
small speeds near the turning points. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Find the expectation value of the position for a 
particle in the ground state of a harmonic oscillator using symmetry. 


Solution: 


(2) = 


Quantum probability density distributions change in character for excited states, 
becoming more like the classical distribution when the quantum number gets higher. We 
observe this change already for the first excited state of a quantum oscillator because the 
distribution |, (a)|? peaks up around the turning points and vanishes at the equilibrium 
position, as seen in [link]. In accordance with Bohr’s correspondence principle, in the 
limit of high quantum numbers, the quantum description of a harmonic oscillator 
converges to the classical description, which is illustrated in [link]. The classical 
probability density distribution corresponding to the quantum energy of the n = 12 state 


is areasonably good approximation of the quantum probability distribution for a 
quantum oscillator in this excited state. This agreement becomes increasingly better for 


highly excited states. 


a 


The probability density distribution for finding the quantum harmonic oscillator in 
its n = 12 quantum state. The dashed curve shows the probability density 
distribution of a classical oscillator with the same energy. 


Summary 


e The quantum harmonic oscillator is a model built in analogy with the model of a 
classical harmonic oscillator. It models the behavior of many physical systems, such 
as molecular vibrations or wave packets in quantum optics. 

¢ The allowed energies of a quantum oscillator are discrete and evenly spaced. The 
energy spacing is equal to Planck’s energy quantum. 

e The ground state energy is larger than zero. This means that, unlike a classical 
oscillator, a quantum oscillator is never at rest, even at the bottom of a potential 


well, and undergoes quantum fluctuations. 

e The stationary states (states of definite energy) have nonzero values also in regions 
beyond classical turning points. When in the ground state, a quantum oscillator is 
most likely to be found around the position of the minimum of the potential well, 
which is the least-likely position for a classical oscillator. 

e For high quantum numbers, the motion of a quantum oscillator becomes more 
similar to the motion of a classical oscillator, in accordance with Bohr’s 
correspondence principle. 


Conceptual Questions 


Exercise: 
Problem: 


Is it possible to measure energy of 0.75hw for a quantum harmonic oscillator? 
Why? Why not? Explain. 


Solution: 


No. This energy corresponds to n = 0.25, but n must be an integer. 
Exercise: 
Problem: 
Explain the connection between Planck’s hypothesis of energy quanta and the 
energies of the quantum harmonic oscillator. 
Exercise: 
Problem: 


If a classical harmonic oscillator can be at rest, why can the quantum harmonic 
oscillator never be at rest? Does this violate Bohr’s correspondence principle? 


Solution: 


Because the smallest allowed value of the quantum number n for a simple harmonic 
oscillator is 0. No, because quantum mechanics and classical mechanics agree only 
in the limit of large n. 


Exercise: 
Problem: 


Use an example of a quantum particle in a box or a quantum oscillator to explain the 
physical meaning of Bohr’s correspondence principle. 


Exercise: 


Problem: 


Can we simultaneously measure position and energy of a quantum oscillator? Why? 
Why not? 


Solution: 


Yes, within the constraints of the uncertainty principle. If the oscillating particle is 
localized, the momentum and therefore energy of the oscillator are distributed. 


Problems 


Exercise: 


Problem: 


Show that the two lowest energy states of the simple harmonic oscillator, #9 (a) and 
w(x) from [link], satisfy [link]. 


Solution: 


proof 
Exercise: 
Problem: 
If the ground state energy of a simple harmonic oscillator is 1.25 eV, what is the 
frequency of its motion? 
Exercise: 
Problem: 


When a quantum harmonic oscillator makes a transition from the (n + 1) state to 
the n state and emits a 450-nm photon, what is its frequency? 


Solution: 


6.662 x 10!4 Hz 


Exercise: 


Problem: 


Vibrations of the hydrogen molecule Hz can be modeled as a simple harmonic 
oscillator with the spring constant k = 1.13 x 10°N/m and mass 


m = 1.67 x 10-2’ kg. (a) What is the vibrational frequency of this molecule? (b) 
What are the energy and the wavelength of the emitted photon when the molecule 
makes transition between its third and second excited states? 


Exercise: 


Problem: 


A particle with mass 0.030 kg oscillates back-and-forth on a spring with frequency 
4.0 Hz. At the equilibrium position, it has a speed of 0.60 m/s. If the particle is in a 
state of definite energy, find its energy quantum number. 


Solution: 


n = 2.037 x 10°° 
Exercise: 


Problem: 


Find the expectation value (o*) of the square of the position for a quantum 


+00 
ee _ = 
harmonic oscillator in the ground state. Note: / dxx?e—**” = ./n(2a3/2) 
—Co 
Exercise: 
Problem: 


Determine the expectation value of the potential energy for a quantum harmonic 
oscillator in the ground state. Use this to calculate the expectation value of the 
kinetic energy. 


Solution: 


(x) = 0.5mw? (2?) = fw/4; (K) = (E) — (U) = fiw/4 
Exercise: 
Problem: 
Verify that ~1(x) given by [link] is a solution of Schr6dinger’s equation for the 
quantum harmonic oscillator. 


Exercise: 


Problem: 


Estimate the ground state energy of the quantum harmonic oscillator by 
Heisenberg’s uncertainty principle. Start by assuming that the product of the 
uncertainties Az and Ap is at its minimum. Write Ap in terms of Az and assume 
that for the ground state  ~ Az and p = Ap, then write the ground state energy in 
terms of x. Finally, find the value of x that minimizes the energy and find the 
minimum of the energy. 


Solution: 


proof 
Exercise: 
Problem: 
A mass of 0.250 kg oscillates on a spring with the force constant 110 N/m. Calculate 


the ground energy level and the separation between the adjacent energy levels. 
Express the results in joules and in electron-volts. Are quantum effects important? 


The Quantum Tunneling of Particles through Potential Barriers 
By the end of this section, you will be able to: 


e Describe how a quantum particle may tunnel across a potential barrier 

e Identify important physical parameters that affect the tunneling probability 
e Identify the physical phenomena where quantum tunneling is observed 

e Explain how quantum tunneling is utilized in modern technologies 


Quantum tunneling is a phenomenon in which particles penetrate a potential energy barrier with a height greater 
than the total energy of the particles. The phenomenon is interesting and important because it violates the 
principles of classical mechanics. Quantum tunneling is important in models of the Sun and has a wide range of 
applications, such as the scanning tunneling microscope and the tunnel diode. 


Tunneling and Potential Energy 


To illustrate quantum tunneling, consider a ball rolling along a surface with a kinetic energy of 100 J. As the ball 
rolls, it encounters a hill. The potential energy of the ball placed atop the hill is 10 J. Therefore, the ball (with 100 J 
of kinetic energy) easily rolls over the hill and continues on. In classical mechanics, the probability that the ball 
passes over the hill is exactly 1—it makes it over every time. If, however, the height of the hill is increased—a ball 
placed atop the hill has a potential energy of 200 J—the ball proceeds only part of the way up the hill, stops, and 
returns in the direction it came. The total energy of the ball is converted entirely into potential energy before it can 
reach the top of the hill. We do not expect, even after repeated attempts, for the 100-J ball to ever be found beyond 
the hill. Therefore, the probability that the ball passes over the hill is exactly 0, and probability it is turned back or 
“reflected” by the hill is exactly 1. The ball never makes it over the hill. The existence of the ball beyond the hill is 
an impossibility or “energetically forbidden.” 


However, according to quantum mechanics, the ball has a wave function and this function is defined over all 
space. The wave function may be highly localized, but there is always a chance that as the ball encounters the hill, 
the ball will suddenly be found beyond it. Indeed, this probability is appreciable if the “wave packet” of the ball is 
wider than the barrier. 


Note: 
View this interactive simulation for a simulation of tunneling. 


In the language of quantum mechanics, the hill is characterized by a potential barrier. A finite-height square 
barrier is described by the following potential-energy function: 


Note: 
Equation: 
0, whenz <0 
U(x) = 4 Uo, when0<a<L 
0, whenz > L. 


The potential barrier is illustrated in [link]. When the height Uo of the barrier is infinite, the wave packet 
representing an incident quantum particle is unable to penetrate it, and the quantum particle bounces back from the 
barrier boundary, just like a classical particle. When the width L of the barrier is infinite and its height is finite, a 


part of the wave packet representing an incident quantum particle can filter through the barrier boundary and 
eventually perish after traveling some distance inside the barrier. 


U(x) 4 
Uy 4 Incident 
Transmitted 
——> 
—— 
Reflected 
— +-------------4--------4-------------- 


0 L 


A potential energy barrier of height Uo creates three 
physical regions with three different wave behaviors. In 
region I where x < 0, an incident wave packet (incident 

particle) moves in a potential-free zone and coexists with a 
reflected wave packet (reflected particle). In region II, a 
part of the incident wave that has not been reflected at 
x = 0 moves as a transmitted wave in a constant potential 
U(x) = +Up and tunnels through to region III at « = L. 

In region III for 2 > LD, a wave packet (transmitted 

particle) that has tunneled through the potential barrier 
moves as a free particle in potential-free zone. The energy 
E of the incident particle is indicated by the horizontal line. 


When both the width L and the height Uo are finite, a part of the quantum wave packet incident on one side of the 
barrier can penetrate the barrier boundary and continue its motion inside the barrier, where it is gradually 
attenuated on its way to the other side. A part of the incident quantum wave packet eventually emerges on the 
other side of the barrier in the form of the transmitted wave packet that tunneled through the barrier. How much of 
the incident wave can tunnel through a barrier depends on the barrier width L and its height Uo, and on the energy 
E of the quantum particle incident on the barrier. This is the physics of tunneling. 


Barrier penetration by quantum wave functions was first analyzed theoretically by Friedrich Hund in 1927, shortly 
after Schrédinger published the equation that bears his name. A year later, George Gamow used the formalism of 
quantum mechanics to explain the radioactive a-decay of atomic nuclei as a quantum-tunneling phenomenon. The 
invention of the tunnel diode in 1957 made it clear that quantum tunneling is important to the semiconductor 
industry. In modern nanotechnologies, individual atoms are manipulated using a knowledge of quantum tunneling. 


Tunneling and the Wave Function 


Suppose a uniform and time-independent beam of electrons or other quantum particles with energy E traveling 
along the x-axis (in the positive direction to the right) encounters a potential barrier described by [link]. The 
question is: What is the probability that an individual particle in the beam will tunnel through the potential barrier? 
The answer can be found by solving the boundary-value problem for the time-independent Schr6dinger equation 
for a particle in the beam. The general form of this equation is given by [link], which we reproduce here: 
Equation: 


d(x) 


2m dx? 


+ U(x)(x) = E(x), where — 00 < x < +00. 


In [link], the potential function U(x) is defined by [link]. We assume that the given energy E of the incoming 
particle is smaller than the height Uo of the potential barrier, H < Uo, because this is the interesting physical case. 
Knowing the energy E of the incoming particle, our task is to solve [link] for a function w(2) that is continuous 
and has continuous first derivatives for all x. In other words, we are looking for a “smooth-looking” solution 
(because this is how wave functions look) that can be given a probabilistic interpretation so that 

|o(x) |? = Y"(a)p(a) is the probability density. 


We divide the real axis into three regions with the boundaries defined by the potential function in [link] (illustrated 
in [link]) and transcribe [link] for each region. Denoting by 7;(z) the solution in region I for x < 0, by wy(z) the 
solution in region II for0 < x < LD, and by wyr(z) the solution in region III for z > L, the stationary 
Schr6dinger equation has the following forms in these three regions: 


Equation: 
h? a? 
ae. a tile) = Ey(z), in region I: — 00 < « <0, 
2m dx? 
Equation: 
hn? @? 
OU ca i Se) nae OSES: 
2m dx? 
Equation: 


_w dym(z) 


5 eae E(x), in region III: L < x < +00. 
m L 


The continuity condition at region boundaries requires that: 
Equation: 


(0) = Wn(0), at the boundary between regions I and II and 


and 
Equation: 


wn(L) = om(L), at the boundary between regions II and III. 
The “smoothness” condition requires the first derivative of the solution be continuous at region boundaries: 
Equation: 


dipi(z) 
dx 


_ dyn (z) 
dx 


, at the boundary between regions I and II; 
z=0 


z=0 


and 
Equation: 


diy (zx) 


dz 


dy (x) 


d , at the boundary between regions II and III. 
z=L x 


z=L 


In what follows, we find the functions W(x), Pu(«), and pm (x). 
We can easily verify (by substituting into the original equation and differentiating) that in regions I and III, the 
solutions must be in the following general forms: 
Equation: 
Te) = Aetike ae Be ik 
Equation: 
om(«) = Fett as Ge tk 


where k = V2mE /fh is a wave number and the complex exponent denotes oscillations, 
Equation: 


exkt — cogkx tisin ker. 


The constants A, B, F, and G in [link] and [link] may be complex. These solutions are illustrated in [link]. In 
region I, there are two waves—one is incident (moving to the right) and one is reflected (moving to the left)—so 
none of the constants A and B in [link] may vanish. In region III, there is only one wave (moving to the right), 
which is the transmitted wave, so the constant G must be zero in [link], G = 0. We can write explicitly that the 
incident wave is )i,(z) = Ae**** and that the reflected wave is Wye¢(x) = Be~**”, and that the transmitted wave 
is Wira(z) = Fe****. The amplitude of the incident wave is 

Equation: 


tbin(2t)|? = Vin (@)thin(t) = (Alt) “Act = A*e Aer = AA = | A)? 


Similarly, the amplitude of the reflected wave is |%,e¢(x)|” = |B|? and the amplitude of the transmitted wave is 


\Wtra(2)|? = |F'|?. We know from the theory of waves that the square of the wave amplitude is directly 
proportional to the wave intensity. If we want to know how much of the incident wave tunnels through the barrier, 
we need to compute the square of the amplitude of the transmitted wave. The transmission probability or 
tunneling probability is the ratio of the transmitted intensity (|F'|*) to the incident intensity (|A|?), written as 


Note: 
Equation: 


2 


a She E) = [Ptra(x)|? _ |F|? = = 


lvin(a)|? — |AP? | A 


where L is the width of the barrier and E is the total energy of the particle. This is the probability an individual 
particle in the incident beam will tunnel through the potential barrier. Intuitively, we understand that this 


probability must depend on the barrier height Up. 


In region II, the terms in equation [link] can be rearranged to 
Equation: 


d*yn(z) 


sa? = B'bn(a) 


where £3? is positive because Up > E and the parameter f is a real number, 


Note: 
Equation: 


2m 


Be =S ye Uo a E). 


The general solution to [link] is not oscillatory (unlike in the other regions) and is in the form of exponentials that 
describe a gradual attenuation of w(x), 
Equation: 

w(x) = Ce + Det. 


The two types of solutions in the three regions are illustrated in [link]. 
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Three types of solutions to the stationary Schrddinger 
equation for the quantum-tunneling problem: Oscillatory 
behavior in regions I and III where a quantum particle 
moves freely, and exponential-decay behavior in region II 
(the barrier region) where the particle moves in the 
potential Un. 


Now we use the boundary conditions to find equations for the unknown constants. [link] and [link] are substituted 
into [link] to give 
Equation: 


A+B=C+D. 


[link] and [link] are substituted into [link] to give 
Equation: 


Ce bt + DetbL = Fetikl, 


Similarly, we substitute [link] and [link] into [link], differentiate, and obtain 
Equation: 


~ik(A — B) = B(D —C). 


Similarly, the boundary condition [link] reads explicitly 
Equation: 


B (Det®" — Ce") = —ikFet*”, 


We now have four equations for five unknown constants. However, because the quantity we are after is the 
transmission coefficient, defined in [link] by the fraction F/A, the number of equations is exactly right because 
when we divide each of the above equations by A, we end up having only four unknown fractions: B/A, C/A, D/A, 
and F/A, three of which can be eliminated to find F/A. The actual algebra that leads to expression for F/A is pretty 
lengthy, but it can be done either by hand or with a help of computer software. The end result is 

Equation: 


F etkL 
A cosh (BL) + %(7/2)sinh (BL) 


In deriving [link], to avoid the clutter, we use the substitutions y = 6/k — k/B, 
Equation: 


ev te e’ —ed 


cosh y = 5 , and sinh y = 5 


We substitute [link] into [link] and obtain the exact expression for the transmission coefficient for the barrier, 
Equation: 


P\.F etikL e tkL 
T(L, E) = (=) ‘A cosh (BL) — i(7/2)sinh (BL) cosh (BL) + i(/2)sinh (BL) 


or 


Note: 


Equation: 
1 
cosh?(BL) + (y/2)’sinh? (BL) 


T(L, E) = 


where 
Equation: 


2 1—E/U E/U 
(3) =7( ath | a7 2). 


For a wide and high barrier that transmits poorly, [link] can be approximated by 


Note: 
Equation: 


E E 
180 8) = 1G 
(L, ) ral ae 


Whether it is the exact expression [link] or the approximate expression [link], we see that the tunneling effect very 
strongly depends on the width L of the potential barrier. In the laboratory, we can adjust both the potential height 
Up and the width L to design nano-devices with desirable transmission coefficients. 


Example: 

Transmission Coefficient 

Two copper nanowires are insulated by a copper oxide nano-layer that provides a 10.0-eV potential barrier. 
Estimate the tunneling probability between the nanowires by 7.00-eV electrons through a 5.00-nm thick oxide 
layer. What if the thickness of the layer were reduced to just 1.00 nm? What if the energy of electrons were 
increased to 9.00 eV? 

Strategy 

Treating the insulating oxide layer as a finite-height potential barrier, we use [link]. We identify Up = 10.0 eV, 
FE, = 7.00 eV, EF = 9.00 eV, Ly = 5.00 nm, and Lz = 1.00 nm. We use [link] to compute the exponent. Also, 
we need the rest mass of the electron m = 511 keV/c? and Planck’s constant & = 0.1973keV - nm/c. It is 
typical for this type of estimate to deal with very small quantities that are often not suitable for handheld 
calculators. To make correct estimates of orders, we make the conversion eY = 1027/20) 


Solution 

Constants: 

Equation: 
2 2(511 keV/c? 
—_ cenueewicd) ~ = 26,254, 
h (0.1973keV - nm/c) keV - (nm) 


Equation: 


(10.0eV—E 
keV - (nm)? 


B=4/—(U)—E) = 25.254 ee 1/26.254(10.0 eva ye 
nm 


For a lower-energy electron with #, = 7.00 eV: 
Equation: 


1 
By = J 26.254(10.00 eV — E,)/eV— = 1/26.254(10.00 7.00) , 
nm 


nm nm 


Equation: 


E E 
T(L, E;) = ig (1 = ae = tee (1 as Tew tiam = 3.36e717-75 L/nm | 
Uo Uo 10 10 


For a higher-energy electron with Fy = 9.00 eV: 
Equation: 


1 
Bo = J 26.254(10.00 eV — E2)/eV— = \/26.254(10.00 9.00) ; 
nm 


nm nm 


Equation: 


E E 
T(L, Ex) = 162 (1 — =2 ) e264 = 16 (1 — 2 \e20254/nm — 4 4gen1025 L/nm, 
Uo Uo 10 


For a broad barrier with LZ; = 5.00 nm: 
Equation: 


T (Ly, Ey) = 3.36e7177 ta/nm — 3 36¢717-755.00nm/nm _ 3 36¢—88 — 3.36(6.2 x 10-%) = 2.1% x 107%, 
Equation: 
T (Ly, E2) = 1.44e71075 21/nm _ 1 44 10-255.00nm/nm _ 7 gel? _ 1 44(5.81 x 107) = 8.36% x 10°? 


For a narrower barrier with Ly = 1.00 nm: 
Equation: 


T (Lo, E1) = 3.36e7 177 22/0 — 3.36¢717-75:1.00 nm/am _ 3 366-17 _ 3 36(5.1 x 1077) = 1.7% x 1074, 
Equation: 
T (Lz, Ex) = 1.44e71075 22/nm — 4 44e—10-251.00nm/nm _ 4 4g4e-1025 _ 4 44(3.53 x 1075) = 5.09% x 1077. 


Significance 

We see from these estimates that the probability of tunneling is affected more by the width of the potential barrier 
than by the energy of an incident particle. In today’s technologies, we can manipulate individual atoms on metal 
surfaces to create potential barriers that are fractions of a nanometer, giving rise to measurable tunneling currents. 
One of many applications of this technology is the scanning tunneling microscope (STM), which we discuss later 
in this section. 


Note: 
Exercise: 


Problem: 


Check Your Understanding A proton with kinetic energy 1.00 eV is incident on a square potential barrier 
with height 10.00 eV. If the proton is to have the same transmission probability as an electron of the same 
energy, what must the width of the barrier be relative to the barrier width encountered by an electron? 


Solution: 


We eocon) elec on = 4/me/Mp = DBM 


Radioactive Decay 


In 1928, Gamow identified quantum tunneling as the mechanism responsible for the radioactive decay of atomic 
nuclei. He observed that some isotopes of thorium, uranium, and bismuth disintegrate by emitting a-particles 
(which are doubly ionized helium atoms or, simply speaking, helium nuclei). In the process of emitting an a- 
particle, the original nucleus is transformed into a new nucleus that has two fewer neutrons and two fewer protons 
than the original nucleus. The a-particles emitted by one isotope have approximately the same kinetic energies. 
When we look at variations of these energies among isotopes of various elements, the lowest kinetic energy is 
about 4 MeV and the highest is about 9 MeV, so these energies are of the same order of magnitude. This is about 
where the similarities between various isotopes end. 


When we inspect half-lives (a half-life is the time in which a radioactive sample loses half of its nuclei due to 
decay), different isotopes differ widely. For example, the half-life of polonium-214 is 160 ps and the half-life of 
uranium is 4.5 billion years. Gamow explained this variation by considering a ‘spherical-box’ model of the 
nucleus, where a-particles can bounce back and forth between the walls as free particles. The confinement is 
provided by a strong nuclear potential at a spherical wall of the box. The thickness of this wall, however, is not 
infinite but finite, so in principle, a nuclear particle has a chance to escape this nuclear confinement. On the inside 
wall of the confining barrier is a high nuclear potential that keeps the a-particle in a small confinement. But when 
an a-particle gets out to the other side of this wall, it is subject to electrostatic Coulomb repulsion and moves away 
from the nucleus. This idea is illustrated in [link]. The width L of the potential barrier that separates an a-particle 
from the outside world depends on the particle’s kinetic energy E. This width is the distance between the point 
marked by the nuclear radius R and the point Ro where an a-particle emerges on the other side of the barrier, 
L = Ro — R. At the distance Ro, its kinetic energy must at least match the electrostatic energy of repulsion, 
b= (4ne9) Ze? / Ro (where + Ze is the charge of the nucleus). In this way we can estimate the width of the 
nuclear barrier, 
Equation: 
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We see from this estimate that the higher the energy of a-particle, the narrower the width of the barrier that it is to 
tunnel through. We also know that the width of the potential barrier is the most important parameter in tunneling 
probability. Thus, highly energetic a-particles have a good chance to escape the nucleus, and, for such nuclei, the 
nuclear disintegration half-life is short. Notice that this process is highly nonlinear, meaning a small increase in the 
a-particle energy has a disproportionately large enhancing effect on the tunneling probability and, consequently, on 
shortening the half-life. This explains why the half-life of polonium that emits 8-MeV a-particles is only hundreds 
of milliseconds and the half-life of uranium that emits 4-MeV a-particles is billions of years. 


U(r) 


E|--- ee 


The potential energy barrier for an a- 
particle bound in the nucleus: To escape 
from the nucleus, an a-particle with 
energy E must tunnel across the barrier 
from distance R to distance Ro away 
from the center. 


Field Emission 


Field emission is a process of emitting electrons from conducting surfaces due to a strong external electric field 
that is applied in the direction normal to the surface ([link]). As we know from our study of electric fields in earlier 
chapters, an applied external electric field causes the electrons in a conductor to move to its surface and stay there 
as long as the present external field is not excessively strong. In this situation, we have a constant electric potential 
throughout the inside of the conductor, including its surface. In the language of potential energy, we say that an 
electron inside the conductor has a constant potential energy U(x) = —Up (here, the x means inside the 
conductor). In the situation represented in [link], where the external electric field is uniform and has magnitude Ei, 
, if an electron happens to be outside the conductor at a distance x away from its surface, its potential energy would 
have to be U(x) = —eE gz (here, x denotes distance to the surface). Taking the origin at the surface, so that z = 0 
is the location of the surface, we can represent the potential energy of conduction electrons in a metal as the 
potential energy barrier shown in [link]. In the absence of the external field, the potential energy becomes a step 
barrier defined by U(x < 0) = —Up and by U(az > 0) = 0. 


A normal-direction external electric field at the 
surface of a conductor: In a strong field, the 
electrons on a conducting surface may get 
detached from it and accelerate against the 
external electric field away from the surface. 


The potential energy barrier at the surface of a metallic conductor in the 
presence of an external uniform electric field EZ, normal to the surface: It 
becomes a step-function barrier when the external field is removed. The work 
function of the metal is indicated by ¢. 


When an external electric field is strong, conduction electrons at the surface may get detached from it and 
accelerate along electric field lines in a direction antiparallel to the external field, away from the surface. In short, 
conduction electrons may escape from the surface. The field emission can be understood as the quantum tunneling 
of conduction electrons through the potential barrier at the conductor’s surface. The physical principle at work here 
is very similar to the mechanism of a-emission from a radioactive nucleus. 


Suppose a conduction electron has a kinetic energy E (the average kinetic energy of an electron in a metal is the 
work function ¢ for the metal and can be measured, as discussed for the photoelectric effect in Photons and Matter 
Waves), and an external electric field can be locally approximated by a uniform electric field of strength E’,. The 
width L of the potential barrier that the electron must cross is the distance from the conductor’s surface to the point 
outside the surface where its kinetic energy matches the value of its potential energy in the external field. In [link], 
this distance is measured along the dashed horizontal line U(x) = E from x = 0 to the intercept with 
U(x) = —eE x, so the barrier width is 
Equation: 

eik ¢'¢ 


L= = ‘ 
E, E, 


We see that L is inversely proportional to the strength E, of an external field. When we increase the strength of the 
external field, the potential barrier outside the conductor becomes steeper and its width decreases for an electron 
with a given kinetic energy. In turn, the probability that an electron will tunnel across the barrier (conductor 
surface) becomes exponentially larger. The electrons that emerge on the other side of this barrier form a current 
(tunneling-electron current) that can be detected above the surface. The tunneling-electron current is proportional 
to the tunneling probability. The tunneling probability depends nonlinearly on the barrier width L, and L can be 
changed by adjusting ,. Therefore, the tunneling-electron current can be tuned by adjusting the strength of an 
external electric field at the surface. When the strength of an external electric field is constant, the tunneling- 
electron current has different values at different elevations L above the surface. 


The quantum tunneling phenomenon at metallic surfaces, which we have just described, is the physical principle 
behind the operation of the scanning tunneling microscope (STM), invented in 1981 by Gerd Binnig and 
Heinrich Rohrer. The STM device consists of a scanning tip (a needle, usually made of tungsten, platinum-iridium, 
or gold); a piezoelectric device that controls the tip’s elevation in a typical range of 0.4 to 0.7 nm above the surface 
to be scanned; some device that controls the motion of the tip along the surface; and a computer to display images. 
While the sample is kept at a suitable voltage bias, the scanning tip moves along the surface ([link]), and the 
tunneling-electron current between the tip and the surface is registered at each position. The amount of the current 
depends on the probability of electron tunneling from the surface to the tip, which, in turn, depends on the 
elevation of the tip above the surface. Hence, at each tip position, the distance from the tip to the surface is 
measured by measuring how many electrons tunnel out from the surface to the tip. This method can give an 
unprecedented resolution of about 0.001 nm, which is about 1% of the average diameter of an atom. In this way, 
we can see individual atoms on the surface, as in the image of a carbon nanotube in [link]. 


STM tip 


Tunneling-electron 
current 


Surface atoms 
being 
scanned 


Computer monitor 


In STM, a surface at a constant potential is being scanned by a narrow tip 
moving along the surface. When the STM tip moves close to surface atoms, 
electrons can tunnel from the surface to the tip. This tunneling-electron current is 
continually monitored while the tip is in motion. The amount of current at 
location (x,y) gives information about the elevation of the tip above the surface at 
this location. In this way, a detailed topographical map of the surface is created 
and displayed on a computer monitor. 


An STM image of a carbon nanotube: Atomic-scale 
resolution allows us to see individual atoms on the 
surface. STM images are in gray scale, and coloring is 
added to bring up details to the human eye. (credit: 
Taner Yildirim, NIST) 


Resonant Quantum Tunneling 


Quantum tunneling has numerous applications in semiconductor devices such as electronic circuit components or 
integrated circuits that are designed at nanoscales; hence, the term ‘nanotechnology.’ For example, a diode (an 


electric-circuit element that causes an electron current in one direction to be different from the current in the 
opposite direction, when the polarity of the bias voltage is reversed) can be realized by a tunneling junction 
between two different types of semiconducting materials. In such a tunnel diode, electrons tunnel through a single 
potential barrier at a contact between two different semiconductors. At the junction, tunneling-electron current 
changes nonlinearly with the applied potential difference across the junction and may rapidly decrease as the bias 
voltage is increased. This is unlike the Ohm’s law behavior that we are familiar with in household circuits. This 
kind of rapid behavior (caused by quantum tunneling) is desirable in high-speed electronic devices. 


Another kind of electronic nano-device utilizes resonant tunneling of electrons through potential barriers that 
occur in quantum dots. A quantum dot is a small region of a semiconductor nanocrystal that is grown, for 
example, in a silicon or aluminum arsenide crystal. [link](a) shows a quantum dot of gallium arsenide embedded in 
an aluminum arsenide wafer. The quantum-dot region acts as a potential well of a finite height (shown in [link](b)) 
that has two finite-height potential barriers at dot boundaries. Similarly, as for a quantum particle in a box (that is, 
an infinite potential well), lower-lying energies of a quantum particle trapped in a finite-height potential well are 
quantized. The difference between the box and the well potentials is that a quantum particle in a box has an infinite 
number of quantized energies and is trapped in the box indefinitely, whereas a quantum particle trapped in a 
potential well has a finite number of quantized energy levels and can tunnel through potential barriers at well 
boundaries to the outside of the well. Thus, a quantum dot of gallium arsenide sitting in aluminum arsenide is a 
potential well where low-lying energies of an electron are quantized, indicated as got in part (b) in the figure. 
When the energy Eetectron of an electron in the outside region of the dot does not match its energy Ego that it 
would have in the dot, the electron does not tunnel through the region of the dot and there is no current through 
such a circuit element, even if it were kept at an electric voltage difference (bias). However, when this voltage bias 
is changed in such a way that one of the barriers is lowered, so that Fao, and Fetectron become aligned, as seen in 
part (c) of the figure, an electron current flows through the dot. When the voltage bias is now increased, this 
alignment is lost and the current stops flowing. When the voltage bias is increased further, the electron tunneling 
becomes improbable until the bias voltage reaches a value for which the outside electron energy matches the next 
electron energy level in the dot. The word ‘resonance’ in the device name means that the tunneling-electron current 
occurs only when a selected energy level is matched by tuning an applied voltage bias, such as in the operation 
mechanism of the resonant-tunneling diode just described. Resonant-tunneling diodes are used as super-fast 
nano-switches. 


quantum 
dot 


gallium aluminum 
arsenide arsenide 


(a) (b) (c) 


Resonant-tunneling diode: (a) A quantum dot of gallium arsenide embedded in aluminum arsenide. 
(b) Potential well consisting of two potential barriers of a quantum dot with no voltage bias. Electron 
energies Electron in aluminum arsenide are not aligned with their energy levels Fao; in the quantum 
dot, so electrons do not tunnel through the dot. (c) Potential well of the dot with a voltage bias across 
the device. A suitably tuned voltage difference distorts the well so that electron-energy levels in the 
dot are aligned with their energies in aluminum arsenide, causing the electrons to tunnel through the 

dot. 


Summary 


e A quantum particle that is incident on a potential barrier of a finite width and height may cross the barrier and 
appear on its other side. This phenomenon is called ‘quantum tunneling.’ It does not have a classical analog. 

e To find the probability of quantum tunneling, we assume the energy of an incident particle and solve the 
stationary Schr6dinger equation to find wave functions inside and outside the barrier. The tunneling 
probability is a ratio of squared amplitudes of the wave past the barrier to the incident wave. 

e The tunneling probability depends on the energy of the incident particle relative to the height of the barrier 
and on the width of the barrier. It is strongly affected by the width of the barrier in a nonlinear, exponential 
way so that a small change in the barrier width causes a disproportionately large change in the transmission 
probability. 

¢ Quantum-tunneling phenomena govern radioactive nuclear decays. They are utilized in many modern 
technologies such as STM and nano-electronics. STM allows us to see individual atoms on metal surfaces. 
Electron-tunneling devices have revolutionized electronics and allow us to build fast electronic devices of 
miniature sizes. 
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Conceptual Questions 


Exercise: 


Problem: 


When an electron and a proton of the same kinetic energy encounter a potential barrier of the same height and 
width, which one of them will tunnel through the barrier more easily? Why? 


Exercise: 


Problem: 


What decreases the tunneling probability most: doubling the barrier width or halving the kinetic energy of the 
incident particle? 


Solution: 
doubling the barrier width 


Exercise: 


Problem: Explain the difference between a box-potential and a potential of a quantum dot. 

Exercise: 
Problem: Can a quantum particle ‘escape’ from an infinite potential well like that in a box? Why? Why not? 
Solution: 


No, the restoring force on the particle at the walls of an infinite square well is infinity. 


Exercise: 


Problem: 


A tunnel diode and a resonant-tunneling diode both utilize the same physics principle of quantum tunneling. 
In what important way are they different? 


Problems 


Exercise: 


Problem: Show that the wave function in (a) [link] satisfies [link], and (b) [link] satisfies [link]. 
Solution: 


A complex function of the form, Ae’, satisfies Schr6dinger’s time-independent equation. The operators for 
kinetic and total energy are linear, so any linear combination of such wave functions is also a valid solution to 
Schr6dinger’s equation. Therefore, we conclude that [link] satisfies [link], and [link] satisfies [link]. 


Exercise: 
Problem: 
A 6.0-eV electron impacts on a barrier with height 11.0 eV. Find the probability of the electron to tunnel 
through the barrier if the barrier width is (a) 0.80 nm and (b) 0.40 nm. 
Exercise: 
Problem: 


A 5.0-eV electron impacts on a barrier of with 0.60 nm. Find the probability of the electron to tunnel through 
the barrier if the barrier height is (a) 7.0 eV; (b) 9.0 eV; and (c) 13.0 eV. 


Solution: 
a. 4.21%; b. 0.84%; c. 0.06% 


Exercise: 


Problem: 


A 12.0-eV electron encounters a barrier of height 15.0 eV. If the probability of the electron tunneling through 
the barrier is 2.5 %, find its width. 
Exercise: 
Problem: 
A quantum particle with initial kinetic energy 32.0 eV encounters a square barrier with height 41.0 eV and 


width 0.25 nm. Find probability that the particle tunnels through this barrier if the particle is (a) an electron 
and, (b) a proton. 


Solution: 


a. 0.13%; b. close to 0% 


Exercise: 


Problem: 


A simple model of a radioactive nuclear decay assumes that a-particles are trapped inside a well of nuclear 
potential that walls are the barriers of a finite width 2.0 fm and height 30.0 MeV. Find the tunneling 
probability across the potential barrier of the wall for a-particles having kinetic energy (a) 29.0 MeV and (b) 
20.0 MeV. The mass of the a-particle is m = 6.64 x 107?’ kg. 


Exercise: 
Problem: 
A muon, a quantum particle with a mass approximately 200 times that of an electron, is incident on a 


potential barrier of height 10.0 eV. The kinetic energy of the impacting muon is 5.5 eV and only about 0.10% 
of the squared amplitude of its incoming wave function filters through the barrier. What is the barrier’s width? 


Solution: 
0.38 nm 


Exercise: 
Problem: 
A grain of sand with mass 1.0 mg and kinetic energy 1.0 J is incident on a potential energy barrier with height 


1.000001 J and width 2500 nm. How many grains of sand have to fall on this barrier before, on the average, 
one passes through? 


Additional Problems 


Exercise: 
Problem: 


Show that if the uncertainty in the position of a particle is on the order of its de Broglie’s wavelength, then the 
uncertainty in its momentum is on the order of the value of its momentum. 


Solution: 


proof 
Exercise: 
Problem: 
The mass of a p-meson is measured to be 770 MeV/c? with an uncertainty of 100 MeV/c”. Estimate the 
lifetime of this meson. 
Exercise: 
Problem: 
A particle of mass m is confined to a box of width L. If the particle is in the first excited state, what are the 


probabilities of finding the particle in a region of width 0.020 L around the given point x: (a) « = 0.25L; (b) 
x = 0.40Z; (c) e = 0.750; and (d) c = 0.90L. 


Solution: 


a. 4.0 %; b. 1.4 %; c. 4.0%; d. 1.4% 


Exercise: 


Problem: A particle in a box [0;L] is in the third excited state. What are its most probable positions? 
Exercise: 

Problem: 

A 0.20-kg billiard ball bounces back and forth without losing its energy between the cushions of a 1.5 m long 

table. (a) If the ball is in its ground state, how many years does it need to get from one cushion to the other? 

You may compare this time interval to the age of the universe. (b) How much energy is required to make the 


ball go from its ground state to its first excited state? Compare it with the kinetic energy of the ball moving at 
2.0 m/s. 


Solution: 


at = mL?/h=2.15 x 10” years; b. FE) = 1.46 x 10°°J,K =0.4J 
Exercise: 
Problem: 
Find the expectation value of the position squared when the particle in the box is in its third excited state and 
the length of the box is L. 
Exercise: 
Problem: 


Consider an infinite square well with wall boundaries = 0 and « = L. Show that the function 
(a) = Asin kz is the solution to the stationary SchrOdinger equation for the particle in a box only if 


k=V2mE /h. Explain why this is an acceptable wave function only if k is an integer multiple of 7/L. 
Solution: 


proof 
Exercise: 

Problem: 

Consider an infinite square well with wall boundaries « = 0 and « = L. Explain why the function 

(a) = Acos kz is not a solution to the stationary Schrédinger equation for the particle in a box. 
Exercise: 

Problem: 

Atoms in a crystal lattice vibrate in simple harmonic motion. Assuming a lattice atom has a mass of 


9.4 x 10~*° kg, what is the force constant of the lattice if a lattice atom makes a transition from the ground 
state to first excited state when it absorbs a 525-ym photon? 


Solution: 


1.2 N/m 
Exercise: 
Problem: 
A diatomic molecule behaves like a quantum harmonic oscillator with the force constant 12.0 N/m and mass 
5.60 x 10-26 kg. (a) What is the wavelength of the emitted photon when the molecule makes the transition 


from the third excited state to the second excited state? (b) Find the ground state energy of vibrations for this 
diatomic molecule. 


Exercise: 
Problem: 


An electron with kinetic energy 2.0 MeV encounters a potential energy barrier of height 16.0 MeV and width 
2.00 nm. What is the probability that the electron emerges on the other side of the barrier? 


Solution: 


0 
Exercise: 
Problem: 


A beam of mono-energetic protons with energy 2.0 MeV falls on a potential energy barrier of height 20.0 
MeV and of width 1.5 fm. What percentage of the beam is transmitted through the barrier? 


Challenge Problems 


Exercise: 
Problem: 
An electron in a long, organic molecule used in a dye laser behaves approximately like a quantum particle in a 


box with width 4.18 nm. Find the emitted photon when the electron makes a transition from the first excited 
state to the ground state and from the second excited state to the first excited state. 


Solution: 


19.2 um; 11.5 um 

Exercise: 
Problem: 
In STM, an elevation of the tip above the surface being scanned can be determined with a great precision, 
because the tunneling-electron current between surface atoms and the atoms of the tip is extremely sensitive 
to the variation of the separation gap between them from point to point along the surface. Assuming that the 
tunneling-electron current is in direct proportion to the tunneling probability and that the tunneling probability 
is to a good approximation expressed by the exponential function e~?°” with 6 = 10.0 /nm, determine the 


ratio of the tunneling current when the tip is 0.500 nm above the surface to the current when the tip is 0.515 
nm above the surface. 


Exercise: 
Problem: 
If STM is to detect surface features with local heights of about 0.00200 nm, what percent change in 


tunneling-electron current must the STM electronics be able to detect? Assume that the tunneling-electron 
current has characteristics given in the preceding problem. 


Solution: 


3.92% 
Exercise: 
Problem: 


Use Heisenberg’s uncertainty principle to estimate the ground state energy of a particle oscillating on an 
spring with angular frequency, w = of k/m, where k is the spring constant and m is the mass. 


Exercise: 


Problem: 


Suppose an infinite square well extends from —L/2 to +L/2. Solve the time-independent Schr6dinger’s 
equation to find the allowed energies and stationary states of a particle with mass m that is confined to this 
well. Then show that these solutions can be obtained by making the coordinate transformation z/ = x — L/2 
for the solutions obtained for the well extending between 0 and L. 


Solution: 


proof 
Exercise: 


Problem: 


A particle of mass m confined to a box of width L is in its first excited state ¢2(z). (a) Find its average 
position (which is the expectation value of the position). (b) Where is the particle most likely to be found? 


Glossary 


field emission 
electron emission from conductor surfaces when a strong external electric field is applied in normal direction 
to conductor’s surface 


nanotechnology 
technology that is based on manipulation of nanostructures such as molecules or individual atoms to produce 
nano-devices such as integrated circuits 


potential barrier 
potential function that rises and falls with increasing values of position 


quantum dot 
small region of a semiconductor nanocrystal embedded in another semiconductor nanocrystal, acting as a 
potential well for electrons 


quantum tunneling 
phenomenon where particles penetrate through a potential energy barrier with a height greater than the total 
energy of the particles 


resonant tunneling 
tunneling of electrons through a finite-height potential well that occurs only when electron energies match an 
energy level in the well, occurs in quantum dots 


resonant-tunneling diode 
quantum dot with an applied voltage bias across it 


scanning tunneling microscope (STM) 
device that utilizes quantum-tunneling phenomenon at metallic surfaces to obtain images of nanoscale 
structures 


transmission probability 
also called tunneling probability, the probability that a particle will tunnel through a potential barrier 


tunnel diode 
electron tunneling-junction between two different semiconductors 


tunneling probability 
also called transmission probability, the probability that a particle will tunnel through a potential barrier 


Introduction 
class="introduction" 


NGC1763 is 
an emission 
nebula in 
the Large 
Magellanic 
Cloud, 
which is a 
satellite 
galaxy to 
our Milky 
Way 
Galaxy. The 
colors we 
see can be 
explained 
by applying 
the ideas of 
quantum 
mechanics 
to atomic 
structure. 
(credit: 
modificatio 
n of work 
by NASA, 
ESA, and 
Josh Lake) 


In this chapter, we use quantum mechanics to study the structure and 
properties of atoms. This study introduces ideas and concepts that are 
necessary to understand more complex systems, such as molecules, crystals, 
and metals. As we deepen our understanding of atoms, we build on things 
we already know, such as Rutherford’s nuclear model of the atom, Bohr’s 
model of the hydrogen atom, and de Broglie’s wave hypothesis. 


[link] is NGC1763, an emission nebula in the small galaxy known as the 
Large Magellanic Cloud, which is a satellite of the Milky Way Galaxy. 
Ultraviolet light from hot stars ionizes the hydrogen atoms in the nebula. As 
protons and electrons recombine, radiation of different frequencies is 
emitted. The details of this process can be correctly predicted by quantum 
mechanics and are examined in this chapter. 


The Hydrogen Atom 
By the end of this section, you will be able to: 


e Describe the hydrogen atom in terms of wave function, probability 
density, total energy, and orbital angular momentum 

e Identify the physical significance of each of the quantum numbers ( 
n, l,m) of the hydrogen atom 

e Distinguish between the Bohr and Schrédinger models of the atom 

e Use quantum numbers to calculate important information about the 
hydrogen atom 


The hydrogen atom is the simplest atom in nature and, therefore, a good 
Starting point to study atoms and atomic structure. The hydrogen atom 
consists of a single negatively charged electron that moves about a positively 
charged proton ({link]). In Bohr’s model, the electron is pulled around the 
proton in a perfectly circular orbit by an attractive Coulomb force. The 
proton is approximately 1800 times more massive than the electron, so the 
proton moves very little in response to the force on the proton by the 
electron. (This is analogous to the Earth-Sun system, where the Sun moves 
very little in response to the force exerted on it by Earth.) An explanation of 
this effect using Newton’s laws is given in Photons and Matter Waves. 
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A representation of the Bohr model of the 
hydrogen atom. 


With the assumption of a fixed proton, we focus on the motion of the 
electron. 


In the electric field of the proton, the potential energy of the electron is 
Equation: 


2 
U(r) = i 


Lr 


where k = 1/479 and r is the distance between the electron and the proton. 
As we Saw earlier, the force on an object is equal to the negative of the 
gradient (or slope) of the potential energy function. For the special case of a 
hydrogen atom, the force between the electron and proton is an attractive 
Coulomb force. 


Notice that the potential energy function U(r) does not vary in time. As a 
result, Schrédinger’s equation of the hydrogen atom reduces to two simpler 
equations: one that depends only on space (x, y, z) and another that depends 
only on time (t). (The separation of a wave function into space- and time- 
dependent parts for time-independent potential energy functions is discussed 
in Quantum Mechanics.) We are most interested in the space-dependent 
equation: 

Equation: 


kh? (0*p Oy JAH e? 
—— + —_ + —_ ]-k—y=E 
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where w = w (a, y, Z) is the three-dimensional wave function of the 
electron, ™m, is the mass of the electron, and E is the total energy of the 
electron. Recall that the total wave function V (2, y, z,t), is the product of 
the space-dependent wave function 7 = (a, y, z) and the time-dependent 
wave function » = y(t). 


In addition to being time-independent, U(r) is also spherically symmetrical. 
This suggests that we may solve Schrédinger’s equation more easily if we 
express it in terms of the spherical coordinates (r, 8, ¢) instead of 


rectangular coordinates (x, y, z). A spherical coordinate system is shown in 
[link]. In spherical coordinates, the variable r is the radial coordinate, 0 is the 
polar angle (relative to the vertical z-axis), and ¢ is the azimuthal angle 
(relative to the x-axis). The relationship between spherical and rectangular 


coordinates isx = rsin@cos¢, y=rsin@sing, z=rcos#. 
z 


The relationship between the spherical and 
rectangular coordinate systems. 


The factor r sin 0 is the magnitude of a vector formed by the projection of 
the polar vector onto the xy-plane. Also, the coordinates of x and y are 
obtained by projecting this vector onto the x- and y-axes, respectively. The 
inverse transformation gives 

Equation: 


vA x 
r=Voer+y+22, 0=cos* (=), ¢ =cos* 
rf 
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Schrédinger’s wave equation for the hydrogen atom in spherical coordinates 
is discussed in more advanced courses in modern physics, so we do not 
consider it in detail here. However, due to the spherical symmetry of U(r), 
this equation reduces to three simpler equations: one for each of the three 
coordinates (r,@, and @). Solutions to the time-independent wave function 
are written as a product of three functions: 

Equation: 


P(r, 0,6) = R(r)O(8)8(4), 


where R is the radial function dependent on the radial coordinate r only; © is 
the polar function dependent on the polar coordinate @ only; and ® is the phi 
function of ¢ only. Valid solutions to Schrédinger’s equation ~(r, 6, @) are 
labeled by the quantum numbers n, |, and m. 


Equation: 
i: principal quantum number 
ie angular momentum quantum number 
m: angular momentum projection quantum number 


(The reasons for these names will be explained in the next section.) The 
radial function R depends only on n and J; the polar function © depends only 
on / and m; and the phi function ® depends only on m. The dependence of 
each function on quantum numbers is indicated with subscripts: 

Equation: 


Vrim(T, 9, 6) = Rr(r)Oim(9)Pm(¢). 


Not all sets of quantum numbers (n, |, m) are possible. For example, the 
orbital angular quantum number ! can never be greater or equal to the 
principal quantum number n (I < n). Specifically, we have 

Equation: 


n = 1,2,3,... 
0/152 and) 
i, = 14D 20). 


~~ 


Notice that for the ground state, n = 1, 1 = 0, and m = O. In other words, 
there is only one quantum state with the wave function for n = 1, and it is 
W100. However, for n = 2, we have 

Equation: 


l1=0, m=0 
l[=1, m=-1,0,1. 


Therefore, the allowed states for the n = 2 state are W200, W21_1, W210, and 
wo11. Example wave functions for the hydrogen atom are given in [link]. 
Note that some of these expressions contain the letter i, which represents 
4/—1. When probabilities are calculated, these complex numbers do not 
appear in the final answer. 
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Wave Functions of the Hydrogen Atom 


Physical Significance of the Quantum Numbers 


Each of the three quantum numbers of the hydrogen atom (n, I, m) is 
associated with a different physical quantity. The principal quantum 
number n is associated with the total energy of the electron, &,,. According 
to Schrédinger’s equation: 


Equation: 
204 
E, =- (=) (=) __ (=). 
22 n? n? 


where /y) = —13.6 eV. Notice that this expression is identical to that of 
Bohr’s model. As in the Bohr model, the electron in a particular state of 
energy does not radiate. 


Example: 

How Many Possible States? 

For the hydrogen atom, how many possible quantum states correspond to 
the principal number n = 3? What are the energies of these states? 
Strategy 

For a hydrogen atom of a given energy, the number of allowed states 
depends on its orbital angular momentum. We can count these states for 
each value of the principal quantum number, n = 1, 2, 3. However, the total 
energy depends on the principal quantum number only, which means that 
we can use [link] and the number of states counted. 


Solution 
If n = 3, the allowed values of | are 0, 1, and 2. If 1 = 0, m = 0 (1 state). If 
l= 1,m = —1,0,-+1 @ states); and if! = 2, m = —2, —-1,0,+1,4+2 (65 


states). In total, there are 1 + 3+ 5 = 9 allowed states. Because the total 
energy depends only on the principal quantum number, n = 3, the energy of 
each of these states is 

Equation: 


Eng = —Eo (=z) = ae = —1.5leV. 
Significance 

An electron in a hydrogen atom can occupy many different angular 
momentum states with the very same energy. As the orbital angular 
momentum increases, the number of the allowed states with the same 
energy increases. 


The angular momentum orbital quantum number | is associated with the 
orbital angular momentum of the electron in a hydrogen atom. Quantum 
theory tells us that when the hydrogen atom is in the state w,),,, the 
magnitude of its orbital angular momentum is 


Note: 
Equation: 
L=,/ll+ 1h, 
where 
Equation: 


1=0,1,2,...,(n—1). 


This result is slightly different from that found with Bohr’s theory, which 
quantizes angular momentum according to the rule 
£=n,wheren = 1,2, 3,.... 


Quantum states with different values of orbital angular momentum are 
distinguished using spectroscopic notation ([link]). The designations s, p, d, 


and f result from early historical attempts to classify atomic spectral lines. 
(The letters stand for sharp, principal, diffuse, and fundamental, 
respectively.) After f, the letters continue alphabetically. 


The ground state of hydrogen is designated as the 1s state, where “1” 
indicates the energy level (n = 1) and “s” indicates the orbital angular 
momentum state (1 = 0). When n = 2, ! can be either 0 or 1. The n = 2, 
[ = O state is designated “2s.” The n = 2, / = 1 state is designated “2p.” 
When n = 3, / can be 0, 1, or 2, and the states are 3s, 3p, and 3d, 
respectively. Notation for other quantum states is given in [link]. 


The angular momentum projection quantum number m is associated 
with the azimuthal angle ¢@ (see [link]) and is related to the z-component of 
orbital angular momentum of an electron in a hydrogen atom. This 
component is given by 


Note: 
Equation: 


L, = mh, 


where 
Equation: 


m= -l,-I+1,...,0,...,4+0-—1,l. 
The z-component of angular momentum is related to the magnitude of 


angular momentum by 
Equation: 


L,=Lcos9@, 


where @ is the angle between the angular momentum vector and the z-axis. 
Note that the direction of the z-axis is determined by experiment—that is, 
along any direction, the experimenter decides to measure the angular 
momentum. For example, the z-direction might correspond to the direction 


of an external magnetic field. The relationship between Land L is given in 
[ink]. 


The z-component of angular momentum is 
quantized with its own quantum number 
m. 


Orbital Quantum Angular Spectroscopic 


Number / Momentum State Name 

0 0 S Sharp 

1 /2h p Principal 

2 V/6h d Diffuse 

3 V12h f Fundamental 
4 V 20h g 

5 V30h h 


Spectroscopic Notation and Orbital Angular Momentum 


n=1 Is 

n=2 2s 2p 

n=3 3s 3p 3d 

n=4 As Ap Ad Af 

n= 5 5s 5p 5d of 5g 

n=6 6s 6p 6d 6f 6g 6h 


Spectroscopic Description of Quantum States 


The quantization of D, is equivalent to the quantization of #. Substituting 
,/U(1 + 1)h for L and m for L, into this equation, we find 


Equation: 
mh = 4/1(1+ 1)iicos 6. 


Thus, the angle 8 is quantized with the particular values 
Equation: 


6=cos + (=| . 
J1(i +1) 


Notice that both the polar angle (@) and the projection of the angular 
momentum vector onto an arbitrary z-axis (L,) are quantized. 


The quantization of the polar angle for the / = 3 state is shown in [link]. The 
orbital angular momentum vector lies somewhere on the surface of a cone 
with an opening angle @ relative to the z-axis (unless m = 0, in which case 
§ = 90° and the vector points are perpendicular to the z-axis). 


The quantization of orbital angular 
momentum. Each vector lies on the 
surface of a cone with axis along the 
Z-axis. 


A detailed study of angular momentum reveals that we cannot know all three 
components simultaneously. In the previous section, the z-component of 
orbital angular momentum has definite values that depend on the quantum 
number m. This implies that we cannot know both x- and y-components of 
angular momentum, L, and L,,, with certainty. As a result, the precise 


direction of the orbital angular momentum vector is unknown. 


Example: 
What Are the Allowed Directions? 


Calculate the angles that the angular momentum vector L can make with the 
z-axis for 1 = 1, as shown in [link]. 


The component of a given angular momentum 
along the z-axis (defined by the direction of a 
magnetic field) can have only certain values. 

These are shown here for / = 1, for which 


m = —1,0,and + 1. The direction of L is 
quantized in the sense that it can have only 
certain angles relative to the z-axis. 


Strategy 

The vectors L and L. (in the z-direction) form a right triangle, where Lis 
the hypotenuse and L, is the adjacent side. The ratio of L, to IL is the 
cosine of the angle of interest. The magnitudes L = IL and L, are given 
by 

Equation: 


L=,/l(l+1)hand L, = mh. 


Solution 

We are given ! = 1, so ml can be +1, 0, or — 1. Thus, L has the value 
given by 

Equation: 


L = /i(t+ 1h = 3K. 


The quantity D, can have three values, given by L, = mih. 
Equation: 


h,m,=+1 
i — a — 0, m, =0 
—h,m| — —] 


As you can see in [link], cos @ = L,/L, so for m = +1, we have 
Equation: 
L h 1 
cos 0; = a SS = 
L JR V2 


Thus, 
Equation: 


6, = cos 10.707 = 45.0°. 


Similarly, for m = 0, we find cos 02 = 0; this gives 


Equation: 

6, = cos 10 = 90.0°. 
Then form; = —1: 
Equation: 

L —h 1 

cos 03 = a ae —0.707, 

LC VR v2 
so that 
Equation: 

63 = cos *(—0.707) = 135.0’. 

Significance 


The angles are consistent with the figure. Only the angle relative to the z- 
axis is quantized. L can point in any direction as long as it makes the proper 
angle with the z-axis. Thus, the angular momentum vectors lie on cones, as 
illustrated. To see how the correspondence principle holds here, consider 
that the smallest angle (@; in the example) is for the maximum value of m,, 
namely m, = l. For that smallest angle, 

Equation: 


L, l 
COs — 


LE ian 


which approaches 1 as | becomes very large. If cos 8 = 1, then 6 = 0°. 
Furthermore, for large |, there are many values of ™,, so that all angles 
become possible as !| gets very large. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Can the magnitude of DL, ever be equal to 
ee 


Solution: 


No. The quantum number m = —l1,—[+1,...,0,...,/ — 1,2. Thus, 
the magnitude of L, is always less than L because < 4/1 (1+ 1) 


Using the Wave Function to Make Predictions 


As we Saw earlier, we can use quantum mechanics to make predictions about 
physical events by the use of probability statements. It is therefore proper to 
state, “An electron is located within this volume with this probability at this 
time,” but not, “An electron is located at the position (x, y, z) at this time.” 
To determine the probability of finding an electron in a hydrogen atom ina 
particular region of space, it is necessary to integrate the probability density 
\nim|? over that region: 

Equation: 


Probability = : |Pntm|2dV , 


volume 


where dV is an infinitesimal volume element. If this integral is computed for 
all space, the result is 1, because the probability of the particle to be located 
somewhere is 100% (the normalization condition). In a more advanced 


* 
course on modern physics, you will find that | Prim|? = Wi Waring Where 

* 
Wim is the complex conjugate. This eliminates the occurrences of 7 = ~ —1 
in the above calculation. 


Consider an electron in a state of zero angular momentum (/ = 0). In this 
case, the electron’s wave function depends only on the radial coordinate r. 


(Refer to the states 199 and 2009 in [link].) The infinitesimal volume 
element corresponds to a spherical shell of radius r and infinitesimal 
thickness dr, written as 

Equation: 


dV = 4rr?dr. 


The probability of finding the electron in the region r to r + dr (“at 
approximately r’”) is 


Note: 
Equation: 


P(r)dr = noo |?4ar7dr. 


Here P(r) is called the radial probability density function (a probability 
per unit length). For an electron in the ground state of hydrogen, the 
probability of finding an electron in the region r to r + dr is 

Equation: 


|noo|?4ar?dr = (4/a3)r? exp(—2r/ag)dr, 


where @g = 0.5 angstroms. The radial probability density function P(r) is 
plotted in [link]. The area under the curve between any two radial positions, 
Say 71 and rg, gives the probability of finding the electron in that radial 
range. To find the most probable radial position, we set the first derivative of 
this function to zero (€P/dr = 0) and solve for r. The most probable radial 
position is not equal to the average or expectation value of the radial position 
because |%n09|? is not symmetrical about its peak value. 


Most probable radial position 


a Ip 


The radial probability density function for the 
ground state of hydrogen. 


If the electron has orbital angular momentum (I 0), then the wave 
functions representing the electron depend on the angles 6 and ¢; that is, 
Wntm =Vnim (1, 8, d). Atomic orbitals for three states with n = 2 andl = 1 
are shown in [link]. An atomic orbital is a region in space that encloses a 
certain percentage (usually 90%) of the electron probability. (Sometimes 
atomic orbitals are referred to as “clouds” of probability.) Notice that these 
distributions are pronounced in certain directions. This directionality is 
important to chemists when they analyze how atoms are bound together to 
form molecules. 


Zz Zz Zz 


(a) (b) (Cc) 


The probability density distributions for three states with n = 2 and 
! = 1. The distributions are directed along the (a) x-axis, (b) y-axis, 
and (c) z-axis. 


A slightly different representation of the wave function is given in [link]. In 
this case, light and dark regions indicate locations of relatively high and low 
probability, respectively. In contrast to the Bohr model of the hydrogen atom, 
the electron does not move around the proton nucleus in a well-defined path. 
Indeed, the uncertainty principle makes it impossible to know how the 
electron gets from one place to another. 
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Probability clouds for the electron in the ground state and several 
excited states of hydrogen. The probability of finding the electron is 
indicated by the shade of color; the lighter the coloring, the greater the 
chance of finding the electron. 


Summary 


e A hydrogen atom can be described in terms of its wave function, 
probability density, total energy, and orbital angular momentum. 

e The state of an electron in a hydrogen atom is specified by its quantum 
numbers (n, |, m). 

e In contrast to the Bohr model of the atom, the Schrédinger model 
makes predictions based on probability statements. 

e The quantum numbers of a hydrogen atom can be used to calculate 
important information about the atom. 


Conceptual Questions 


Exercise: 
Problem: 


Identify the physical significance of each of the quantum numbers of 
the hydrogen atom. 


Solution: 


n (principal quantum number) — total energy 

l (orbital angular quantum number) — total absolute magnitude of the 
orbital angular momentum 

m (orbital angular projection quantum number) — z-component of the 
orbital angular momentum 


Exercise: 
Problem: 
Describe the ground state of hydrogen in terms of wave function, 
probability density, and atomic orbitals. 
Exercise: 
Problem: 
Distinguish between Bohr’s and Schrédinger’s model of the hydrogen 


atom. In particular, compare the energy and orbital angular momentum 
of the ground states. 


Solution: 


The Bohr model describes the electron as a particle that moves around 
the proton in well-defined orbits. Schrédinger’s model describes the 
electron as a wave, and knowledge about the position of the electron is 
restricted to probability statements. The total energy of the electron in 
the ground state (and all excited states) is the same for both models. 
However, the orbital angular momentum of the ground state is different 
for these models. In Bohr’s model, L (ground state) = 1, and in 
Schrédinger’s model, L (ground state) = 0. 


Problems 


Exercise: 


Problem: 


The wave function is evaluated at rectangular coordinates (x, y, z) = (2, 
1, 1) in arbitrary units. What are the spherical coordinates of this 
position? 


Solution: 
(r,0,$) = (/6,66", 27°). 


Exercise: 


Problem: 


If an atom has an electron in the n = 5 state with m = 3, what are the 
possible values of [? 


Exercise: 


Problem: 
What are the possible values of m for an electron in the n = 4 state? 


Solution: 


+3, +2, +1, 0 are possible 
Exercise: 


Problem: 


What, if any, constraints does a value of m = 1 place on the other 
quantum numbers for an electron in an atom? 


Exercise: 


Problem: How many possible states are there for the | = 4 state? 


Solution: 


18 
Exercise: 
Problem: 
(a) How many angles can L make with the z-axis for an / = 2 electron? 
(b) Calculate the value of the smallest angle. 
Exercise: 
Problem: 
The force on an electron is “negative the gradient of the potential 


energy function.” Use this knowledge and [link] to show that the force 
on the electron in a hydrogen atom is given by Coulomb’s force law. 


Solution: 
F=-k% 


Exercise: 


Problem: 


What is the total number of states with orbital angular momentum / = 0 
? (Ignore electron spin.) 


Exercise: 


Problem: 


The wave function is evaluated at spherical coordinates 

(7,0,;0)= (v3, 45°, 45°) , where the value of the radial coordinate is 
given in arbitrary units. What are the rectangular coordinates of this 
position? 


Solution: 


(151.1) 
Exercise: 


Problem: 


Coulomb’s force law states that the force between two charged particles 
is: 


r= kb. Use this expression to determine the potential energy 
function. 


Exercise: 


Problem: 


Write an expression for the total number of states with orbital angular 
momentum !. 


Solution: 


For the orbital angular momentum quantum number, |, the allowed 
values of: 

m= -—l,—-[+1.,...0,...0—1, 1. 

With the exception of m = 0, the total number is just 2/ because the 
number of states on either side of m = 0 is just /. Including m = 0, the 
total number of orbital angular momentum states for the orbital angular 
momentum quantum number, I, is: 2/ + 1. Later, when we consider 
electron spin, the total number of angular momentum states will be 


found to twice this value because each orbital angular momentum states 
is associated with two states of electron spin: spin up and spin down). 


Exercise: 
Problem: 
Consider hydrogen in the ground state, 7199. (a) Use the derivative to 


determine the radial position for which the probability density, P(r), is a 
maximum. 


(b) Use the integral concept to determine the average radial position. 
(This is called the expectation value of the electron’s radial position.) 
Express your answers into terms of the Bohr radius, ag. Hint: The 
expectation value is the just average value. (c) Why are these values 
different? 


Exercise: 
Problem: 


What is the probability that the 1s electron of a hydrogen atom is found 
outside the Bohr radius? 


Solution: 


The probability that the 1s electron of a hydrogen atom is found outside 
of the Bohr radius is / P(r)dr = 0.68 


ao 
Exercise: 


Problem: 


How many polar angles are possible for an electron in the / = 5 state? 
Exercise: 
Problem: 


What is the maximum number of orbital angular momentum electron 
states in the n = 2 shell of a hydrogen atom? (Ignore electron spin.) 


Solution: 


For n = 2,1 = 0 (1 state), and / = 1 (3 states). The total is 4. 
Exercise: 


Problem: 


What is the maximum number of orbital angular momentum electron 
states in the n = 3 shell of a hydrogen atom? (Ignore electron spin.) 


Glossary 


angular momentum orbital quantum number (1) 
quantum number associated with the orbital angular momentum of an 
electron in a hydrogen atom 


angular momentum projection quantum number (m) 
quantum number associated with the z-component of the orbital angular 
momentum of an electron in a hydrogen atom 


atomic orbital 
region in space that encloses a certain percentage (usually 90%) of the 
electron probability 


principal quantum number (n) 
quantum number associated with the total energy of an electron in a 
hydrogen atom 


radial probability density function 
function use to determine the probability of a electron to be found in a 
spatial interval in r 


Orbital Magnetic Dipole Moment of the Electron 
By the end of this section, you will be able to: 


e Explain why the hydrogen atom has magnetic properties 

e Explain why the energy levels of a hydrogen atom associated with 
orbital angular momentum are split by an external magnetic field 

e Use quantum numbers to calculate the magnitude and direction of the 
orbital magnetic dipole moment of a hydrogen atom 


In Bohr’s model of the hydrogen atom, the electron moves in a circular 
orbit around the proton. The electron passes by a particular point on the 
loop in a certain time, so we can calculate a current J = Q/t. An electron 
that orbits a proton in a hydrogen atom is therefore analogous to current 
flowing through a circular wire ([link]). In the study of magnetism, we saw 
that a current-carrying wire produces magnetic fields. It is therefore 
reasonable to conclude that the hydrogen atom produces a magnetic field 
and interacts with other magnetic fields. 
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(a) Current-carrying loop (b) Hydrogen atom 


(a) Current flowing through a circular wire is analogous to (b) an 
electron that orbits a proton in a hydrogen atom. 


The orbital magnetic dipole moment is a measure of the strength of the 
magnetic field produced by the orbital angular momentum of an electron. 
From Force and Torque on a Current Loop, the magnitude of the orbital 
magnetic dipole moment for a current loop is 

Equation: 


w= TIA, 


where IJ is the current and A is the area of the loop. (For brevity, we refer to 
this as the magnetic moment.) The current J associated with an electron in 
orbit about a proton in a hydrogen atom is 

Equation: 


Pe. 
T 


where e is the magnitude of the electron charge and T is its orbital period. If 
we assume that the electron travels in a perfectly circular orbit, the orbital 
period is 

Equation: 


where r is the radius of the orbit and v is the speed of the electron in its 
orbit. Given that the area of a circle is 7r?, the absolute magnetic moment 
is 

Equation: 


p= IA= ——a_7r* = — 


It is helpful to express the magnetic momentum yp in terms of the orbital 


> 


angular momentum (L =r x p). Because the electron orbits in a circle, 
the position vector r and the momentum vector p form a right angle. Thus, 
the magnitude of the orbital angular momentum is 

Equation: 


L= | = |f x p| =rpsind = rp=rmv. 


Combining these two equations, we have 


Equation: 
e 
= L 
i ( 2M- ) 


In full vector form, this expression is written as 


Note: 
Equation: 


The negative sign appears because the electron has a negative charge. 
Notice that the direction of the magnetic moment of the electron is 
antiparallel to the orbital angular momentum, as shown in [link](b). In the 


Bohr model of the atom, the relationship between pz and Lin [link] is 
independent of the radius of the orbit. 


The magnetic moment p can also be expressed in terms of the orbital 
angular quantum number /. Combining [link] and [link], the magnitude of 
the magnetic moment is 

Equation: 


i (s-)e= (= ) yt 2) = pm tC 2), 


e 


The z-component of the magnetic moment is 
Equation: 


e€ e€ 
2 Le A= — : 
. (s—) (sm cds 


The quantity zp is a fundamental unit of magnetism called the Bohr 
magneton, which has the value 9.3 x 10~*joule/tesla (J/T) or 

5.8 x 10 °eV/T. Quantization of the magnetic moment is the result of 
quantization of the orbital angular momentum. 


As we will see in the next section, the total magnetic dipole moment of the 
hydrogen atom is due to both the orbital motion of the electron and its 
intrinsic spin. For now, we ignore the effect of electron spin. 


Example: 

Orbital Magnetic Dipole Moment 

What is the magnitude of the orbital dipole magnetic moment jz of an 
electron in the hydrogen atom in the (a) s state, (b) p state, and (c) d state? 
(Assume that the spin of the electron is zero.) 

Strategy 

The magnetic momentum of the electron is related to its orbital angular 
momentum L. For the hydrogen atom, this quantity is related to the orbital 
angular quantum number I. The states are given in spectroscopic notation, 
which relates a letter (s, p, d, etc.) to a quantum number. 

Solution 

The magnitude of the magnetic moment is given in [link]: 

Equation: 


fe (z= (se) Vid +a) = wnt), 


a. For the s state, 1 = 0 so we have pw = O and p, = 0. 
b. For the p state, ] = 1 and we have 
Equation: 


= pp/1(1+1) = V2pp 
[iz = —LBpmM, where m = (—1,0,1),so 
Lz = bp, 0, —pe. 


c. For the d state, 1 = 2 and we obtain 
Equation: 


Jb = ppr/2(2+1) = Vou 
[iz = —LBmM, where m = (—2,—1,0,1,2), so 
Lz = 2up, MB, 0, —UB, —2pB. 


Significance 

In the s state, there is no orbital angular momentum and therefore no 
magnetic moment. This does not mean that the electron is at rest, just that 
the overall motion of the electron does not produce a magnetic field. In the 
p State, the electron has a magnetic moment with three possible values for 
the z-component of this magnetic moment; this means that magnetic 
moment can point in three different polar directions—each antiparallel to 
the orbital angular momentum vector. In the d state, the electron has a 
magnetic moment with five possible values for the z-component of this 
magnetic moment. In this case, the magnetic moment can point in five 
different polar directions. 


A hydrogen atom has a magnetic field, so we expect the hydrogen atom to 
interact with an external magnetic field—such as the push and pull between 
two bar magnets. From Force and Torque on a Current Loop, we know that 


when a current loop interacts with an external magnetic field B, it 


experiences a torque given by 
Equation: 


where IJ is the current, A is the area of the loop, # is the magnetic moment, 


and B is the external magnetic field. This torque acts to rotate the magnetic 
moment vector of the hydrogen atom to align with the external magnetic 
field. Because mechanical work is done by the external magnetic field on 
the hydrogen atom, we can talk about energy transformations in the atom. 
The potential energy of the hydrogen atom associated with this magnetic 
interaction is given by [link]: 

Equation: 


U=-ji-B. 


If the magnetic moment is antiparallel to the external magnetic field, the 
potential energy is large, but if the magnetic moment is parallel to the field, 
the potential energy is small. Work done on the hydrogen atom to rotate the 
atom’s magnetic moment vector in the direction of the external magnetic 
field is therefore associated with a drop in potential energy. The energy of 
the system is conserved, however, because a drop in potential energy 
produces radiation (the emission of a photon). These energy transitions are 
quantized because the magnetic moment can point in only certain 
directions. 


If the external magnetic field points in the positive z-direction, the potential 
energy associated with the orbital magnetic dipole moment is 


Note: 
Equation: 


U (6) = —pB cos 0 = —p,B = — (—ppm)B = mppB, 


where jg is the Bohr magneton and m is the angular momentum projection 
quantum number (or magnetic orbital quantum number), which has the 


values 
Equation: 


m=-—I,—1+1,...,0,..,2—11. 


For example, in the / = 1 electron state, the total energy of the electron is 
split into three distinct energy levels corresponding to U = —ypB, 0, upB. 


The splitting of energy levels by an external magnetic field is called the 
Zeeman effect. Ignoring the effects of electron spin, transitions from the 

| = 1 state to a common lower energy state produce three closely spaced 
spectral lines ((link], left column). Likewise, transitions from the / = 2 state 
produce five closely spaced spectral lines (right column). The separation of 
these lines is proportional to the strength of the external magnetic field. 
This effect has many applications. For example, the splitting of lines in the 
hydrogen spectrum of the Sun is used to determine the strength of the Sun’s 
magnetic field. Many such magnetic field measurements can be used to 
make a map of the magnetic activity at the Sun’s surface called a 
magnetogram ((link]). 

(/= 1) > (/ = 0) (| = 2) > (= 0) 


,=0 


ext 


Large B,., 


The Zeeman effect refers to the splitting of spectral 
lines by an external magnetic field. In the left column, 
the energy splitting occurs due to transitions from the 
state (n = 2,1 = 1) to a lower energy state; and in the 
right column, energy splitting occurs due to transitions 
from the state (n = 2,1 = 2) to a lower-energy state. 

The separation of these lines is proportional to the 

strength of the external magnetic field. 


A magnetogram of the Sun. The bright and dark spots 
show significant magnetic activity at the surface of the 
Sun. (credit: NASA, SDO) 


Summary 


e A hydrogen atom has magnetic properties because the motion of the 
electron acts as a current loop. 

e The energy levels of a hydrogen atom associated with orbital angular 
momentum are split by an external magnetic field because the orbital 
angular magnetic moment interacts with the field. 

e The quantum numbers of an electron in a hydrogen atom can be used 
to calculate the magnitude and direction of the orbital magnetic dipole 
moment of the atom. 


Conceptual Questions 


Exercise: 
Problem: 
Explain why spectral lines of the hydrogen atom are split by an 


external magnetic field. What determines the number and spacing of 
these lines? 


Exercise: 
Problem: 
A hydrogen atom is placed in a magnetic field. Which of the following 


quantities are affected? (a) total energy; (b) angular momentum; (c) z- 
component of angular momentum; (d) polar angle. 


Solution: 


a, c, d; The total energy is changed (Zeeman splitting). The work done 
on the hydrogen atom rotates the atom, so the z-component of angular 
momentum and polar angle are affected. However, the angular 
momentum is not affected. 


Exercise: 


Problem: 


On what factors does the orbital magnetic dipole moment of an 
electron depend? 


Problems 


Exercise: 
Problem: 


Find the magnitude of the orbital magnetic dipole moment of the 
electron in in the 3p state. (Express your answer in terms of pp.) 


Solution: 


The 3p state corresponds to n = 3, | = 2. Therefore, 4 = up V6 
Exercise: 

Problem: 

A current of J = 2A flows through a square-shaped wire with 2-cm 

side lengths. What is the magnetic moment of the wire? 
Exercise: 

Problem: 

Estimate the ratio of the electron magnetic moment to the muon 


magnetic moment for the same state of orbital angular momentum. 
(Hint: m, = 105.7 MeV/c?) 


Solution: 


The ratio of their masses is 1/207, so the ratio of their magnetic 
moments is 207. The electron’s magnetic moment is more than 200 
times larger than the muon. 


Exercise: 


Problem: 

Find the magnitude of the orbital magnetic dipole moment of the 

electron in in the 4d state. (Express your answer in terms of pp.) 
Exercise: 

Problem: 

For a 3d electron in an external magnetic field of 2.50 x 10~° T, find 


(a) the current associated with the orbital angular momentum, and (b) 
the maximum torque. 


Solution: 


a. The 3d state corresponds to n = 3, 1 = 2. So, 
T4438 5<:10- "Ay 
b. The maximum torque occurs when the magnetic moment and 
external magnetic field vectors are at right angles (sin 9 = 1). In this 
case: 
alley 1252 
7 = 5.70 x 10° N-m. 
Exercise: 


Problem: 
An electron in a hydrogen atom is in the n = 5, 1 = 4 state. Find the 


smallest angle the magnetic moment makes with the z-axis. (Express 
your answer in terms of zp.) 


Exercise: 
Problem: 


Find the minimum torque magnitude \7 | that acts on the orbital 


magnetic dipole of a 3p electron in an external magnetic field of 
25010" i 


Solution: 


A 3p electron is in the state n = 3 and! = 1. The minimum torque 
magnitude occurs when the magnetic moment and external magnetic 
field vectors are most parallel (antiparallel). This occurs when 

m = +1.The torque magnitude is given by 

[7| = wBsin 8, 

Where 

p31 % 10/7). 

For m = +1, we have: 

ra 232 x 10 Na 


Exercise: 


Problem: 


An electron in a hydrogen atom is in 3p state. Find the smallest angle 
the magnetic moment makes with the z-axis. (Express your answer in 
terms of jp.) 


Exercise: 


Problem: Show that U = — p- B. 


(Hint: An infinitesimal amount of work is done to align the magnetic 
moment with the external field. This work rotates the magnetic 
moment vector through an angle —dé@ (toward the positive z-direction), 
where dé is a positive angle change.) 


Solution: 


An infinitesimal work dW done by a magnetic torque 7 to rotate the 
magnetic moment through an angle —dé: 

dW = 7 (—d6), 

where T = 7 x B ; Work done is interpreted as a drop in potential 


energy U, so 

dW = —dvu. 

The total energy change is determined by summing over infinitesimal 
changes in the potential energy: 


U = —pBcos 0 
U=-2-B. 


Glossary 


Bohr magneton 
magnetic moment of an electron, equal to 9.3 x 10-745 /Tor 
5.8 x 10 °eV/T 


magnetic orbital quantum number 
another term for the angular momentum projection quantum number 


magnetogram 
pictoral representation, or map, of the magnetic activity at the Sun’s 
surface 


orbital magnetic dipole moment 
measure of the strength of the magnetic field produced by the orbital 
angular momentum of the electron 


Zeeman effect 
splitting of energy levels by an external magnetic field 


Electron Spin 
By the end of this section, you will be able to: 


e Express the state of an electron in a hydrogen atom in terms of five 
quantum numbers 

e Use quantum numbers to calculate the magnitude and direction of the 
spin and magnetic moment of an electron 

e Explain the fine and hyperfine structure of the hydrogen spectrum in 
terms of magnetic interactions inside the hydrogen atom 


In this section, we consider the effects of electron spin. Spin introduces two 
additional quantum numbers to our model of the hydrogen atom. Both were 
discovered by looking at the fine structure of atomic spectra. Spin is a 
fundamental characteristic of all particles, not just electrons, and is analogous 
to the intrinsic spin of extended bodies about their own axes, such as the 
daily rotation of Earth. 


Spin is quantized in the same manner as orbital angular momentum. It has 
been found that the magnitude of the intrinsic spin angular momentum S of 
an electron is given by 


Note: 
Equation: 


§ = y/s(s+ Ih, 


where s is defined to be the spin quantum number. This is similar to the 
quantization of L given in [link], except that the only value allowed for s for 
an electron is s = 1/2. The electron is said to be a “spin-half particle.” The 
spin projection quantum number m™, is associated with the z-components 
of spin, expressed by 


Note: 
Equation: 


In general, the allowed quantum numbers are 
Equation: 


m, = —8s,—s+1,...,0,...,+8—- 


For the special case of an electron (s = 1/2), 
Equation: 


1 1 


Ms = 9? 2° 


1, s. 


Directions of intrinsic spin are quantized, just as they were for orbital angular 
momentum. The m, = —1/2 state is called the “spin-down” state and has a 
z-component of spin, s, = —1/2; them, = +1/2 state is called the “spin- 
up” state and has a z-component of spin, s, = +1/2. These states are shown 


in [link]. 


ee ee S (spin up) 


D enn n nn ------5 S (spin down) 


The two possible states of electron spin. 


The intrinsic magnetic dipole moment of an electron fz. can also be 
expressed in terms of the spin quantum number. In analogy to the orbital 
angular momentum, the magnitude of the electron magnetic moment is 


Equation: 
e€ 
= S. 
a ( 2Me ) 


According to the special theory of relativity, this value is low by a factor of 2. 
Thus, in vector form, the spin magnetic moment is 


Note: 
Equation: 


The z-component of the magnetic moment is 


Equation: 
e€ e€ 
z=—(|— |S,=-— | — ]m, fh. 
7 ( Ne ( Me )m 


The spin projection quantum number has just two values (m, = +1/2), so 
the z-component of the magnetic moment also has just two values: 
Equation: 


p= (5% ) = uot, 


where jp is one Bohr magneton. An electron is magnetic, so we expect the 
electron to interact with other magnetic fields. We consider two special cases: 
the interaction of a free electron with an external (nonuniform) magnetic 
field, and an electron in a hydrogen atom with a magnetic field produced by 
the orbital angular momentum of the electron. 


Example: 

Electron Spin and Radiation 

A hydrogen atom in the ground state is placed in an external uniform 
magnetic field (B = 1.5 T). Determine the frequency of radiation produced 
in a transition between the spin-up and spin-down states of the electron. 
Strategy 


The spin projection quantum number is m, = +1/2, so the z-component of 
the magnetic moment is 
Equation: 


e 
pea (5* ) = Snot. 
e 


The potential energy associated with the interaction between the electron 
magnetic moment and the external magnetic field is 
Equation: 


U = —p,B = FppB. 


The frequency of light emitted is proportional to the energy (A F) difference 
between these two states. 

Solution 

The energy difference between these states is AE = 2upB, so the 
frequency of radiation produced is 


Equation: 
10> eV 
h h 4.136 x 10 PeV-s s 
Significance 


The electron magnetic moment couples with the external magnetic field. 
The energy of this system is different whether the electron is aligned or not 
with the proton. The frequency of radiation produced by a transition 
between these states is proportional to the energy difference. If we double 
the strength of the magnetic field, holding all other things constant, the 
frequency of the radiation doubles and its wavelength is cut in half. 


In a hydrogen atom, the electron magnetic moment can interact with the 
magnetic field produced by the orbital angular momentum of the electron, a 


phenomenon called spin-orbit coupling. The orbital angular momentum (L), 


orbital magnetic moment (j4), spin angular momentum (.S), and spin 
magnetic moment (7 ,) vectors are shown together in [link]. 


Just as the energy levels of a hydrogen atom can be split by an external 
magnetic field, so too are the energy levels of a hydrogen atom split by 
internal magnetic fields of the atom. If the magnetic moment of the electron 
and orbital magnetic moment of the electron are antiparallel, the potential 
energy from the magnetic interaction is relatively high, but when these 
moments are parallel, the potential energy is relatively small. Transition from 
each of these two states to a lower-energy level results in the emission of a 
photon of slightly different frequency. That is, the spin-orbit coupling “splits 
the spectral line expected from a spin-less electron. The fine structure of the 
hydrogen spectrum is explained by spin-orbit coupling. 


9 


Spin-orbit coupling is the 
interaction of an electron’s spin 
magnetic moment /, with its 
orbital magnetic moment [Z;. 


The Stern-Gerlach experiment provides experimental evidence that electrons 
have spin angular momentum. The experiment passes a stream of silver (Ag) 
atoms through an external, nonuniform magnetic field. The Ag atom has an 

orbital angular momentum of zero and contains a single unpaired electron in 


the outer shell. Therefore, the total angular momentum of the Ag atom is due 
entirely to the spin of the outer electron (s = 1/2). Due to electron spin, the 
Ag atoms act as tiny magnets as they pass through the magnetic field. These 
“magnets” have two possible orientations, which correspond to the spin-up 
and -down states of the electron. The magnetic field diverts the spin up atoms 
in one direction and the spin-down atoms in another direction. This produces 
two distinct bands on a screen ((link]). 


Beam 
collimator 
Oven 
“a, 
Beam of 
silver atoms 


Nonuniform 
magnetic field 


Photographic 
plate 


In the Stern-Gerlach experiment, an external, nonuniform magnetic 
field diverts a beam of electrons in two different directions. This 
result is due to the quantization of spin angular momentum. 


According to classical predictions, the angular momentum (and, therefore, 
the magnetic moment) of the Ag atom can point in any direction, so one 
expects, instead, a continuous smudge on the screen. The resulting two bands 
of the Stern-Gerlach experiment provide startling support for the ideas of 
quantum mechanics. 


Note: 
Visit PhET Explorations: Stern-Gerlach Experiment to learn more about the 
Stern-Gerlach experiment. 


Note: 
Exercise: 


Problem: 


Check Your Understanding If the Stern-Gerlach experiment yielded 
four distinct bands instead of two, what might be concluded about the 
spin quantum number of the charged particle? 


Solution: 


$= 3/2 < 


Just like an electron, a proton is spin 1/2 and has a magnetic moment. 
(According to nuclear theory, this moment is due to the orbital motion of 
quarks within the proton.) The hyperfine structure of the hydrogen 
spectrum is explained by the interaction between the magnetic moment of the 
proton and the magnetic moment of the electron, an interaction known as 
spin-spin coupling. The energy of the electron-proton system is different 
depending on whether or not the moments are aligned. Transitions between 
these states (spin-flip transitions) result in the emission of a photon with a 
wavelength of A ~ 21 cm (in the radio range). The 21-cm line in atomic 
spectroscopy is a “fingerprint” of hydrogen gas. Astronomers exploit this 
spectral line to map the spiral arms of galaxies, which are composed mostly 
of hydrogen ([link]). 


(a) (b) (Cc) 


The magnetic interaction between the electron and proton in the 
hydrogen atom is used to map the spiral arms of the Pinwheel Galaxy 
(NGC 5457). (a) The galaxy seen in visible light; (b) the galaxy seen in 
21-cm hydrogen radiation; (c) the composite image of (a) and (b). 
Notice how the hydrogen emission penetrates dust in the galaxy to show 
the spiral arms very clearly, whereas the galactic nucleus shows up 
better in visible light (credit a: modification of work by ESA & NASA; 
credit b: modification of work by Fabian Walter). 


A complete specification of the state of an electron in a hydrogen atom 
requires five quantum numbers: n, |, m, s, and m,. The names, symbols, and 
allowed values of these quantum numbers are summarized in [link]. 


Name Symbol Allowed values 
Principal quantum number n ae a ee 
Angular momentum l 0,1,2,...n—1 


Angular momentum projection m O22 ed 


Name Symbol Allowed values 

Spin S 1/2 (electrons) 

Spin projection om me ee. 
Summary of Quantum Numbers of an Electron in a Hydrogen Atom 


Note that the intrinsic quantum numbers introduced in this section (s and m,) 
are valid for many particles, not just electrons. For example, quarks within an 
atomic nucleus are also spin-half particles. As we will see later, quantum 
numbers help to classify subatomic particles and enter into scientific models 
that attempt to explain how the universe works. 


Summary 


e The state of an electron in a hydrogen atom can be expressed in terms of 
five quantum numbers. 

e The spin angular momentum quantum of an electron is = +'%. The spin 
angular momentum projection quantum number is m, = +“%or — % 
(spin up or spin down). 

e The fine and hyperfine structures of the hydrogen spectrum are 
explained by magnetic interactions within the atom. 


Conceptual Questions 


Exercise: 


Problem: 


Explain how a hydrogen atom in the ground state (J = 0) can interact 
magnetically with an external magnetic field. 


Solution: 


Even in the ground state (J = 0), a hydrogen atom has magnetic 
properties due the intrinsic (internal) electron spin. The magnetic 


moment of an electron is proportional to its spin. 
Exercise: 
Problem: 
Compare orbital angular momentum with spin angular momentum of an 
electron in the hydrogen atom. 
Exercise: 
Problem: 


List all the possible values of s and m, for an electron. Are there 
particles for which these values are different? 


Solution: 


For all electrons, s = % andm, = +. As we will see, not all particles 
have the same spin quantum number. For example, a photon as a spin 1 
(s = 1), and a Higgs boson has spin 0 (s = 0). 


Exercise: 


Problem: 


Are the angular momentum vectors L and S necessarily aligned? 
Exercise: 

Problem: What is spin-orbit coupling? 

Solution: 

An electron has a magnetic moment associated with its intrinsic 

(internal) spin. Spin-orbit coupling occurs when this interacts with the 


magnetic field produced by the orbital angular momentum of the 
electron. 


Problems 


Exercise: 
Problem: 
What is the magnitude of the spin momentum of an electron? (Express 
you answer in terms of h. ) 
Exercise: 
Problem: 


What are the possible polar orientations of the spin momentum vector 
for an electron? 


Solution: 


Spin up (relative to positive z-axis): 
C= 50 
Spin down (relative to positive z-axis): 


a 
§= cos! (¥) = cos! (+) = cos! (+) — 125°. 
> 


Exercise: 


Problem: 


For n = 1, write all the possible sets of quantum numbers (n, 1, m, m,). 
Exercise: 

Problem: 

A hydrogen atom is placed in an external uniform magnetic field ( 


B = 200 T). Calculate the wavelength of light produced in a transition 
from a spin up to spin down state. 


Solution: 


The spin projection quantum number is m, = --%, so the z-component 
of the magnetic moment is 

[iz = Spe. 

The potential energy associated with the interaction between the 


electron and the external magnetic field is 

U = Fup. 

The energy difference between these states is AE = 2p B, so the 
wavelength of light produced is 

A= 8.38 x 10° °mx 84um 


Exercise: 
Problem: 
If the magnetic field in the preceding problem is quadrupled, what 


happens to the wavelength of light produced in a transition from a spin 
up to spin down state? 


Exercise: 
Problem: 
If the magnetic moment in the preceding problem is doubled, what 


happens to the frequency of light produced in a transition from a spin-up 
to spin-down state? 


Solution: 


It is increased by a factor of 2. 
Exercise: 


Problem: 


For n = 2, write all the possible sets of quantum numbers (n, |, m, m5). 


Glossary 


fine structure 
detailed structure of atomic spectra produced by spin-orbit coupling 


hyperfine structure 
detailed structure of atomic spectra produced by spin-orbit coupling 


spin-flip transitions 


atomic transitions between states of an electron-proton system in which 
the magnetic moments are aligned and not aligned 


spin-orbit coupling 
interaction between the electron magnetic moment and the magnetic 
field produced by the orbital angular momentum of the electron 


spin projection quantum number (m,) 
quantum number associated with the z-component of the spin angular 
momentum of an electron 


spin quantum number (s) 
quantum number associated with the spin angular momentum of an 
electron 


The Exclusion Principle and the Periodic Table 
By the end of this section, you will be able to: 


e Explain the importance of Pauli’s exclusion principle to an 
understanding of atomic structure and molecular bonding 

e Explain the structure of the periodic table in terms of the total energy, 
orbital angular momentum, and spin of individual electrons in an atom 

e Describe the electron configuration of atoms in the periodic table 


So far, we have studied only hydrogen, the simplest chemical element. We 
have found that an electron in the hydrogen atom can be completely 
specified by five quantum numbers: 


Equation: 
n: principal quantum number 
Ls angular momentum quantum number 
mM: angular momentum projection quantum number 
8: spin quantum number 
Ms: spin projection quantum number 


To construct the ground state of a neutral multi-electron atom, imagine 
starting with a nucleus of charge Ze (that is, a nucleus of atomic number Z) 
and then adding Z electrons one by one. Assume that each electron moves 
in a spherically symmetrical electric field produced by the nucleus and all 
other electrons of the atom. The assumption is valid because the electrons 
are distributed randomly around the nucleus and produce an average 
electric field (and potential) that is spherically symmetrical. The electric 
potential U(r) for each electron does not follow the simple —1/r form 
because of interactions between electrons, but it turns out that we can still 
label each individual electron state by quantum numbers, (n,1,m, 8, ms). 
(The spin quantum number s is the same for all electrons, so it will not be 
used in this section.) 


The structure and chemical properties of atoms are explained in part by 
Pauli’s exclusion principle: No two electrons in an atom can have the 
same values for all four quantum numbers (n,/,m,m,). This principle is 


related to two properties of electrons: All electrons are identical (“when 
you’ve seen one electron, you’ve seen them all’) and they have half- 
integral spin (s = 1/2). Sample sets of quantum numbers for the electrons 
in an atom are given in [link]. Consistent with Pauli’s exclusion principle, 
no two rows of the table have the exact same set of quantum numbers. 


No. of No. of 

Subshell electrons: electrons: 
n | m Ms symbol subshell shell 
1 0 0 i 

1s 2 2 
1 0 0 VW 
2 0 0 % 

2S 2 
2 0 0 VW 
2 1 1 yy 

_1 

2 1 1 Vy 8 
2 1 O Y% 2p 6 
2 1 0 —-Y 
2 1 1 WY 


Subshell 
symbol 


38 


3p 


3d 


No. of 
electrons: 
subshell 


10 


No. of 
electrons: 
shell 


18 


No. of No. of 


Subshell electrons: electrons: 
n l m Ms symbol subshell shell 
3 2 0 VY 
3 2 1 % 
3 2 1 —Y 
3 2 2 % 
3 2 2 —Y 


Electron States of AtomsBecause of Pauli’s exclusion principle, no two 
electrons in an atom have the same set of four quantum numbers. 


Electrons with the same principal quantum number n are said to be in the 
same shell, and those that have the same value of | are said to occupy the 
same subshell. An electron in the n = 1 state of a hydrogen atom is denoted 
1s, where the first digit indicates the shell (n = 1) and the letter indicates 
the subshell (s, p, d, f... correspond to! = 0,1, 2,3...). Two electrons 
in the n = 1 state are denoted as 1s”, where the superscript indicates the 
number of electrons. An electron in the n = 2 state with 1 = 1 is denoted 
2p. The combination of two electrons in the n = 2 andl = O state, and 
three electrons in the n = 2 and / = 1 state is written as 2s?2p°*, and so on. 
This representation of the electron state is called the electron configuration 
of the atom. The electron configurations for several atoms are given in 
[link]. Electrons in the outer shell of an atom are called valence electrons. 
Chemical bonding between atoms in a molecule are explained by the 
transfer and sharing of valence electrons. 


Element 


H 


He 


Li 


Be 


Al 


Electron Configurations of Electrons in an AtomThe symbol ( 
an unpaired electron in the outer shell, whereas the symbol (T 


Electron Configuration 
1s! 

1s? 

1872s! 
1872s” 
1s72s72p! 
1872s72p* 
1s72s72p° 
1s72s72p* 
1s72s72p° 
1s72s72p° 
1s72s?2p°3s! 
1872s72p°3s” 


1872s72p°3s73p' 


Spin Alignment 
(T) 
(TL) 
(T) 
TD) 
(TL)(1) 
(TL) 
(THM) 
(THAT) 
DTG Lt) 
COC TCC) 
(T) 

(TL) 

LD) 


T) indicates 
|) indicates a 


pair of spin-up and -down electrons in an outer shell. 


The maximum number of electrons in a subshell depends on the value of 
the angular momentum quantum number, |. For a given a value /, there are 
21 + 1 orbital angular momentum states. However, each of these states can 


be filled by two electrons (spin up and down, tJ). Thus, the maximum 
number of electrons in a subshell is 


Note: 
Equation: 


N = 2(21+ 1) = 4142. 


In the 2s (J = 0) subshell, the maximum number of electrons is 2. In the 2p 
(J = 1) subshell, the maximum number of electrons is 6. Therefore, the 
total maximum number of electrons in the n = 2 shell (including both the 

! = 0 and 1 subshells) is 2 + 6 or 8. In general, the maximum number of 
electrons in the nth shell is 2n?. 


Example: 

Subshells and Totals for n = 3 

How many subshells are in the n = 3 shell? Identify each subshell and 
calculate the maximum number of electrons that will fill each. Show that 
the maximum number of electrons that fill an atom is 2n?. 

Strategy 

Subshells are determined by the value of 1; thus, we first determine which 
values of | are allowed, and then we apply the equation “maximum number 
of electrons that can be in a subshell = 2 (27 + 1)” to find the number of 
electrons in each subshell. 

Solution 

Because n = 3, we know that / can be 0, 1, or 2; thus, there are three 
possible subshells. In standard notation, they are labeled the 3s, 3p, and 3d 
subshells. We have already seen that two electrons can be in an s state, and 
six in a p state, but let us use the equation “maximum number of electrons 
that can be in a subshell = 2 (21 + 1)” to calculate the maximum number 
in each: 


Equation: 


3s has 1 = 0; thus, 2(21+ 1) = 2(0+- 1) =2 
3p has! = 1; thus, 2(21 + 1) = 2(2+1) =6 
3d has! = 2; thus, 2(21 + 1) = 2(4+ 1) = 10 
Total = 18 

(in the n = 3 shell). 


The equation “maximum number of electrons that can be in a shell = 2n?” 
gives the maximum number in the n = 3 shell to be 
Equation: 


Maximum number of electrons = 2n? = 2(3)” = 2(9) = 18. 


Significance 

The total number of electrons in the three possible subshells is thus the 
same as the formula 2n?. In standard (spectroscopic) notation, a filled 

n = 3 shell is denoted as 38?3p°3d!°. Shells do not fill in a simple manner. 
Before the n = 3 shell is completely filled, for example, we begin to find 
electrons in the n = 4 shell. 


The structure of the periodic table ([link]) can be understood in terms of 
shells and subshells, and, ultimately, the total energy, orbital angular 
momentum, and spin of the electrons in the atom. A detailed discussion of 
the periodic table is left to a chemistry course—we sketch only its basic 
features here. In this discussion, we assume that the atoms are electrically 
neutral; that is, they have the same number of electrons and protons. (Recall 
that the total number of protons in an atomic nucleus is called the atomic 
number, Z.) 


First, the periodic table is arranged into columns and rows. The table is read 
left to right and top to bottom in the order of increasing atomic number Z. 
Atoms that belong to the same column or chemical group share many of 
the same chemical properties. For example, the Li and Na atoms (in the first 


column) bond to other atoms in a similar way. The first row of the table 
corresponds to the 1s (J = 0) shell of an atom. 


Consider the hypothetical procedure of adding electrons, one by one, to an 
atom. For hydrogen (H) (upper left), the 1s shell is filled with either a spin 
up or down electron (f¢ or |). This lone electron is easily shared with other 
atoms, so hydrogen is chemically active. For helium (He) (upper right), the 
1s shell is filled with both a spin up and a spin down (ft) electron. This 
“fills” the 1s shell, so a helium atom tends not to share electrons with other 
atoms. The helium atom is said to be chemically inactive, inert, or noble; 
likewise, helium gas is said to be an inert gas or noble gas. 


Note: 


Build an atom by adding and subtracting protons, neutrons, and electrons. 
How does the element, charge, and mass change? Visit PhET Explorations: 
Build an Atom to explore the answers to these questions. 
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The periodic table of elements, showing the structure of shells and 
subshells. 


The second row corresponds to the 2s and 2p subshells. For lithium (Li) 
(upper left), the 1s shell is filled with a spin-up and spin-down electron (*) 
and the 2s shell is filled with either a spin-up or -down electron (f or |). Its 
electron configuration is therefore 1572s! or [He]2s, where [He] indicates a 
helium core. Like hydrogen, the lone electron in the outermost shell is 
easily shared with other atoms. For beryllium (Be), the 2s shell is filled with 
a Sy and -down electron (tJ), and has the electron configuration [He] 
28°. 


Next, we look at the right side of the table. For boron (B), the 1s and 2s 
shells are filled and the 2p (J = 1) shell contains either a spin up or down 
electron (+ or |). From carbon (C) to neon (N), we the fill the 2p shell. The 
maximum number of electrons in the 2p shells is 41 + 2 = 4(2) + 2 = 6. 
For neon (Ne), the 1s shell is filled with a spin-up and spin-down electron ( 
tJ), and the 2p shell is filled with six electrons (T | T | T |). This “fills” the 
1s, 2s, and 2p subshells, so like helium, the neon atom tends not to share 
electrons with other atoms. 


The process of electron filling repeats in the third row. However, beginning 
in the fourth row, the pattern is broken. The actual order of order of electron 
filling is given by 


1s, 2s, 2p, 3s, 3p, 4s, 3d, 4p, 5s, 4d, 5p, 6s, 4f, 5d, 6p, 7s.... 


Notice that the 3d, 4d, 4f, and 5d subshells (in bold) are filled out of order; 
this occurs because of interactions between electrons in the atom, which so 
far we have neglected. The transition metals are elements in the gap 
between the first two columns and the last six columns that contain 
electrons that fill the d (J = 1) subshell. As expected, these atoms are 
arranged in 41 + 2 = 4(2) + 2 = 10 columns. The structure of the periodic 
table can be understood in terms of the quantization of the total energy (n), 
orbital angular momentum (I), and spin (s). The first two columns 
correspond to the s (J = 0) subshell, the next six columns correspond to the 
p (l = 1) subshell, and the gap between these columns corresponds to the d 
(J = 2) subshell. 


The periodic table also gives information on molecular bonding. To see this, 
consider atoms in the left-most column (the so-called alkali metals 
including: Li, Na, and K). These atoms contain a single electron in the 2s 
subshell, which is easily donated to other atoms. In contrast, atoms in the 
second-to-right column (the halogens: for example, Cl, F, and Br) are 
relatively stingy in sharing electrons. These atoms would much rather 
accept an electron, because they are just one electron shy of a filled shell 
(“of being noble”). 


Therefore, if a Na atom is placed in close proximity to a Cl atom, the Na 
atom freely donates its 2s electron and the Cl atom eagerly accepts it. In the 


process, the Na atom (originally a neutral charge) becomes positively 
charged and the Cl (originally a neutral charge) becomes negatively 
charged. Charged atoms are called ions. In this case, the ions are Na* and 
Cl, where the superscript indicates charge of the ion. The electric 
(Coulomb) attraction between these atoms forms a NaCl (salt) molecule. A 
chemical bond between two ions is called an ionic bond. There are many 
kinds of chemical bonds. For example, in an oxygen molecule Og electrons 
are equally shared between the atoms. The bonding of oxygen atoms is an 
example of a covalent bond. 


Summary 


e Pauli’s exclusion principle states that no two electrons in an atom can 
have all the same quantum numbers. 

e The structure of the periodic table of elements can be explained in 
terms of the total energy, orbital angular momentum, and spin of 
electrons in an atom. 

e The state of an atom can be expressed by its electron configuration, 
which describes the shells and subshells that are filled in the atom. 


Conceptual Questions 


Exercise: 
Problem: 
What is Pauli’s exclusion principle? Explain the importance of this 


principle for the understanding of atomic structure and molecular 
bonding. 


Exercise: 


Problem: 


Compare the electron configurations of the elements in the same 
column of the periodic table. 


Solution: 


Elements that belong in the same column in the periodic table of 
elements have the same fillings of their outer shells, and therefore the 
same number of valence electrons. For example: 

Li: 1822s! (one valence electron in the n = 2 shell) 

Na: 1s72s2p°3s! (one valence electron in the n = 2 shell) 

Both, Li and Na belong to first column. 


Exercise: 


Problem: 
Compare the electron configurations of the elements that belong in the 
same row of the periodic table of elements. 

Problems 


Exercise: 


Problem: (a) How many electrons can be in the n = 4 shell? 


(b) What are its subshells, and how many electrons can be in each? 


Solution: 

a, 225-0 

L 2(2¢ + 1) 

0 s 2(0+1) = 2 
I op. 2(24e))- S06 
2 d 2(44+1) = 10 
3 f  2(6+1) 14 


Exercise: 


Problem: 


(a) What is the minimum value of | for a subshell that contains 11 
electrons? 


(b) If this subshell is in the n = 5 shell, what is the spectroscopic 
notation for this atom? 
Exercise: 
Problem: 
Unreasonable result. Which of the following spectroscopic notations 


are not allowed? (a) 5s! (b) 1d! (c) 4s? (d) 3p’ (e) 5g°. State which 
rule is violated for each notation that is not allowed. 


Solution: 


a. and e. are allowed; the others are not allowed. 

b. 1 = 3 not allowed forn = 1,1 < (n—1). 

c. Cannot have three electrons in s subshell because 
so2(2 Le) 

d. Cannot have seven electrons in p subshell (max of 6) 
2(21+1) =2(24+1) =6. 


Exercise: 


Problem: Write the electron configuration for potassium. 


Exercise: 


Problem: Write the electron configuration for iron. 


Solution: 


[Ar] 4s73d° 


Exercise: 


Problem: 


The valence electron of potassium is excited to a 5d state. (a) What is 
the magnitude of the electron’s orbital angular momentum? (b) How 
many states are possible along a chosen direction? 


Exercise: 
Problem: 
(a) If one subshell of an atom has nine electrons in it, what is the 


minimum value of I? (b) What is the spectroscopic notation for this 
atom, if this subshell is part of the n = 3 shell? 


Solution: 


a. The minimum value of @ is 1 = 2 to have nine electrons in it. 
b. 3d?. 


Exercise: 


Problem: Write the electron configuration for magnesium. 


Exercise: 


Problem: Write the electron configuration for carbon. 


Solution: 


[He] 2572p" 
Exercise: 


Problem: 


The magnitudes of the resultant spins of the electrons of the elements 
B through Ne when in the ground state are: 3/2, 2h, V15h/2, 


V 2h, V 3h /2, and 0, respectively. Argue that these spins are 
consistent with Hund’s rule. 


Glossary 


chemical group 
group of elements in the same column of the periodic table that 
possess similar chemical properties 


covalent bond 
chemical bond formed by the sharing of electrons between two atoms 


electron configuration 
representation of the state of electrons in an atom, such as 1872s! for 
lithium 


ionic bond 
chemical bond formed by the electric attraction between two 
oppositely charged ions 


Pauli’s exclusion principle 
no two electrons in an atom can have the same values for all four 
quantum numbers (n, J, m, m5) 


transition metal 
element that is located in the gap between the first two columns and 
the last six columns of the table of elements that contains electrons that 
fill the d subshell 


valence electron 
electron in the outer shell of an atom that participates in chemical 
bonding 


Atomic Spectra and X-rays 
By the end of this section, you will be able to: 


¢ Describe the absorption and emission of radiation in terms of atomic energy levels and 
energy differences 

e Use quantum numbers to estimate the energy, frequency, and wavelength of photons 
produced by atomic transitions in multi-electron atoms 

e Explain radiation concepts in the context of atomic fluorescence and X-rays 


The study of atomic spectra provides most of our knowledge about atoms. In modern science, 
atomic spectra are used to identify species of atoms in a range of objects, from distant 
galaxies to blood samples at a crime scene. 


The theoretical basis of atomic spectroscopy is the transition of electrons between energy 
levels in atoms. For example, if an electron in a hydrogen atom makes a transition from the 
n = 3 to the n = 2 shell, the atom emits a photon with a wavelength 

Equation: 


hee he _ he 
h-f AE EB3-—E,’ 


where AF = E73 — Ez is energy carried away by the photon and hc = 1940 eV - nm. After 
this radiation passes through a spectrometer, it appears as a sharp spectral line on a screen. 
The Bohr model of this process is shown in [link]. If the electron later absorbs a photon with 
energy AF, the electron returns to the n = 3 shell. (We examined the Bohr model earlier, in 
Photons and Matter Waves.) 
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An electron transition from the n = 3 to the n = 2 shell of a 
hydrogen atom. 


To understand atomic transitions in multi-electron atoms, it is necessary to consider many 
effects, including the Coulomb repulsion between electrons and internal magnetic interactions 
(spin-orbit and spin-spin couplings). Fortunately, many properties of these systems can be 
understood by neglecting interactions between electrons and representing each electron by its 
own single-particle wave function Wnim. 


Atomic transitions must obey selection rules. These rules follow from principles of quantum 
mechanics and symmetry. Selection rules classify transitions as either allowed or forbidden. 
(Forbidden transitions do occur, but the probability of the typical forbidden transition is very 
small.) For a hydrogen-like atom, atomic transitions that involve electromagnetic interactions 
(the emission and absorption of photons) obey the following selection rule: 


Note: 
Equation: 


Al = +1, 


where | is associated with the magnitude of orbital angular momentum, 
Equation: 


L=4/U(l+1)h. 


For multi-electron atoms, similar rules apply. To illustrate this rule, consider the observed 
atomic transitions in hydrogen (H), sodium (Na), and mercury (Hg) ([link]). The horizontal 
lines in this diagram correspond to atomic energy levels, and the transitions allowed by this 
selection rule are shown by lines drawn between these levels. The energies of these states are 
on the order of a few electron volts, and photons emitted in transitions are in the visible range. 
Technically, atomic transitions can violate the selection rule, but such transitions are 
uncommon. 
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(c) Mercury 


Energy-level diagrams for (a) hydrogen, (b) sodium, and (c) mercury. 
For comparison, hydrogen energy levels are shown in the sodium 
diagram. 


The hydrogen atom has the simplest energy-level diagram. If we neglect electron spin, all 
states with the same value of n have the same total energy. However, spin-orbit coupling splits 
the n = 2 states into two angular momentum states (s and p) of slightly different energies. 
(These levels are not vertically displaced, because the energy splitting is too small to show up 
in this diagram.) Likewise, spin-orbit coupling splits the n = 3 states into three angular 
momentum states (s, p, and d). 


The energy-level diagram for hydrogen is similar to sodium, because both atoms have one 
electron in the outer shell. The valence electron of sodium moves in the electric field of a 
nucleus shielded by electrons in the inner shells, so it does not experience a simple 1/r 
Coulomb potential and its total energy depends on both n and I. Interestingly, mercury has 
two separate energy-level diagrams; these diagrams correspond to two net spin states of its 6s 
(valence) electrons. 


Example: 

The Sodium Doublet 

The spectrum of sodium is analyzed with a spectrometer. Two closely spaced lines with 
wavelengths 589.00 nm and 589.59 nm are observed. (a) If the doublet corresponds to the 
excited (valence) electron that transitions from some excited state down to the 3s state, what 
was the original electron angular momentum? (b) What is the energy difference between 
these two excited states? 

Strategy 

Sodium and hydrogen belong to the same column or chemical group of the periodic table, so 
sodium is “hydrogen-like.” The outermost electron in sodium is in the 3s (J = 0) subshell 
and can be excited to higher energy levels. As for hydrogen, subsequent transitions to lower 
energy levels must obey the selection rule: 

Equation: 


A= 


We must first determine the quantum number of the initial state that satisfies the selection 
rule. Then, we can use this number to determine the magnitude of orbital angular momentum 
of the initial state. 

Solution 


a. Allowed transitions must obey the selection rule. If the quantum number of the initial 
state is 1 = 0, the transition is forbidden because Al = 0. If the quantum number of the 
initial state is ] = 2,3, 4,...the transition is forbidden because Al > 1. Therefore, the 
quantum of the initial state must be / = 1. The orbital angular momentum of the initial 
state is 
Equation: 


L= /\l+1)h=1.41h. 


b. Because the final state for both transitions is the same (3s), the difference in energies of 
the photons is equal to the difference in energies of the two excited states. Using the 
equation 
Equation: 


INSET I 


we have 
Equation: 


AE =he(t- +) 


= i115) 8 1 1 


— 2.11 x 10° eV. 


Significance 

To understand the difficulty of measuring this energy difference, we compare this difference 
with the average energy of the two photons emitted in the transition. Given an average 
wavelength of 589.30 nm, the average energy of the photons is 

Equation: 


ga Pe (4.14 x 10°! eVs)(3.00 x 10° m/s) 
A 589.30 x 10°9m 


=—Z1ey. 


The energy difference AF is about 0.1% (1 part in 1000) of this average energy. However, a 
sensitive spectrometer can measure the difference. 


Atomic Fluorescence 


Fluorescence occurs when an electron in an atom is excited several steps above the ground 
state by the absorption of a high-energy ultraviolet (UV) photon. Once excited, the electron 
“de-excites” in two ways. The electron can drop back to the ground state, emitting a photon of 
the same energy that excited it, or it can drop in a series of smaller steps, emitting several 
low-energy photons. Some of these photons may be in the visible range. Fluorescent dye in 
clothes can make colors seem brighter in sunlight by converting UV radiation into visible 
light. Fluorescent lights are more efficient in converting electrical energy into visible light 
than incandescent filaments (about four times as efficient). [link] shows a scorpion 
illuminated by a UV lamp. Proteins near the surface of the skin emit a characteristic blue 
light. 


A scorpion glows blue under a UV lamp. (credit: Ken Bosma) 


X-rays 


The study of atomic energy transitions enables us to understand X-rays and X-ray technology. 
Like all electromagnetic radiation, X-rays are made of photons. X-ray photons are produced 
when electrons in the outermost shells of an atom drop to the inner shells. (Hydrogen atoms 
do not emit X-rays, because the electron energy levels are too closely spaced together to 
permit the emission of high-frequency radiation.) Transitions of this kind are normally 
forbidden because the lower states are already filled. However, if an inner shell has a vacancy 
(an inner electron is missing, perhaps from being knocked away by a high-speed electron), an 
electron from one of the outer shells can drop in energy to fill the vacancy. The energy gap for 
such a transition is relatively large, so wavelength of the radiated X-ray photon is relatively 
short. 


X-rays can also be produced by bombarding a metal target with high-energy electrons, as 
shown in [link]. In the figure, electrons are boiled off a filament and accelerated by an electric 
field into a tungsten target. According to the classical theory of electromagnetism, any 
charged particle that accelerates emits radiation. Thus, when the electron strikes the tungsten 
target, and suddenly slows down, the electron emits braking radiation. (Braking radiation 
refers to radiation produced by any charged particle that is slowed by a medium.) In this case, 
braking radiation contains a continuous range of frequencies, because the electrons will 
collide with the target atoms in slightly different ways. 


Braking radiation is not the only type of radiation produced in this interaction. In some cases, 
an electron collides with another inner-shell electron of a target atom, and knocks the electron 
out of the atom—billiard ball style. The empty state is filled when an electron in a higher 
shell drops into the state (drop in energy level) and emits an X-ray photon. 


Electron beam 
from cathode Tungsten target 
Heated filament 


Cathode 


Ss Emitted x-rays 
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A sketch of an X-ray tube. X-rays are emitted from the 
tungsten target. 


Historically, X-ray spectral lines were labeled with letters (K, L, M, N, ...). These letters 
correspond to the atomic shells (n = 1, 2,3,4,...). X-rays produced by a transition from any 
higher shell to the K (n = 1) shell are labeled as K X-rays. X-rays produced in a transition 
from the L (n = 2) shell are called K,, X-rays; X-rays produced in a transition from the M ( 
n = 3) shell are called Kg X-rays; X-rays produced in a transition from the N (n = 4) shell 
are called AK X-rays; and so forth. Transitions from higher shells to L and M shells are 
labeled similarly. These transitions are represented by an energy-level diagram in [link]. 


n=4 
(N shell) 
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(M shell) 
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(L shell) 
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(K shell) 


K series 


X-ray transitions in an atom. 


The distribution of X-ray wavelengths produced by striking metal with a beam of electrons is 
given in [link]. X-ray transitions in the target metal appear as peaks on top of the braking 
radiation curve. Photon frequencies corresponding to the spikes in the X-ray distribution are 
called characteristic frequencies, because they can be used to identify the target metal. The 
sharp cutoff wavelength (just below the A peak) corresponds to an electron that loses all of 
its energy to a single photon. Radiation of shorter wavelengths is forbidden by the 
conservation of energy. 


Intensity 


0.01 0.1 1.0 
Wavelength (nm) 


X-ray spectrum from a silver target. The peaks correspond to characteristic 
frequencies of X-rays emitted by silver when struck by an electron beam. 


Example: 

X-Rays from Aluminum 

Estimate the characteristic energy and frequency of the AK, X-ray for aluminum (Z = 13). 
Strategy 

A kK, X-ray is produced by the transition of an electron in the L (n = 2) shell to the K ( 

nm = 1) shell. An electron in the L shell “sees” a charge Z = 13 — 1 = 12, because one 
electron in the K shell shields the nuclear charge. (Recall, two electrons are not in the K shell 
because the other electron state is vacant.) The frequency of the emitted photon can be 
estimated from the energy difference between the L and K shells. 

Solution 

The energy difference between the L and K shells in a hydrogen atom is 10.2 eV. Assuming 
that other electrons in the L shell or in higher-energy shells do not shield the nuclear charge, 
the energy difference between the L and K shells in an atom with Z = 13 is approximately 
Equation: 


AE.+.« © (Z—1)7(10.2 eV) = (13 — 1)? (10.2 eV) = 1.47 x 10%eV. 


Based on the relationship f = (AEz_,x) /h, the frequency of the X-ray is 
Equation: 


1.47 x 10°eV 
f= 85 10! ai. 
4.14 x 10 »eV-s 


Significance 
The wavelength of the typical X-ray is 0.1-10 nm. In this case, the wavelength is: 
Equation: 


, 108 
ee 3.0 x 10'm/s_ SR Se ee Sc. 


f 3.55 x 10!7 Hz 


Hence, the transition L—K in aluminum produces X-ray radiation. 


X-ray production provides an important test of quantum mechanics. According to the Bohr 
model, the energy of a K, X-ray depends on the nuclear charge or atomic number, Z. If Z is 
large, Coulomb forces in the atom are large, energy differences (AF) are large, and, 
therefore, the energy of radiated photons is large. To illustrate, consider a single electron in a 
multi-electron atom. Neglecting interactions between the electrons, the allowed energy levels 
are 

Equation: 


where n = 1, 2, ...and Z is the atomic number of the nucleus. However, an electron in the L ( 
n = 2) shell “sees” a charge Z — 1, because one electron in the K shell shields the nuclear 
charge. (Recall that there is only one electron in the K shell because the other electron was 
“knocked out.”) Therefore, the approximate energies of the electron in the L and K shells are 
Equation: 
2 
eS (G1) Gssey) 


2 
ee (Z-1) ase eV) 


The energy carried away by a photon in a transition from the L shell to the K shell is therefore 
Equation: 


where Z is the atomic number. In general, the X-ray photon energy for a transition from an 
outer shell to the K shell is 
Equation: 


AE,_,.xn = hf =constant x (Z — 1), 


or 


Note: 
Equation: 


(Z — 1) =constant,/f, 


where f is the frequency of a K, X-ray. This equation is Moseley’s law. For large values of Z, 
we have approximately 
Equation: 


~~ constant +/ i 


This prediction can be checked by measuring f for a variety of metal targets. This model is 
supported if a plot of Z versus / 7 data (called a Moseley plot) is linear. Comparison of 
model predictions and experimental results, for both the K and L series, is shown in [link]. 
The data support the model that X-rays are produced when an outer shell electron drops in 
energy to fill a vacancy in an inner shell. 


Note: 
Exercise: 


Problem: 

Check Your Understanding X-rays are produced by bombarding a metal target with 
high-energy electrons. If the target is replaced by another with two times the atomic 
number, what happens to the frequency of X-rays? 


Solution: 


frequency quadruples 


Moseley Plot of Characteristic X-Rays 
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A Moseley plot. These data were adapted from 
Moseley’s original data (H. G. J. Moseley, Philos. Mag. 
(6) 77:703, 1914). 


Example: 

Characteristic X-Ray Energy 

Calculate the approximate energy of a K, X-ray from a tungsten anode in an X-ray tube. 
Strategy 


Two electrons occupy a filled K shell. A vacancy in this shell would leave one electron, so 
the effective charge for an electron in the L shell would be Z — 1 rather than Z. For tungsten, 
Z = TA, so the effective charge is 73. This number can be used to calculate the energy-level 
difference between the L and K shells, and, therefore, the energy carried away by a photon in 
the transition L > K. 


Solution 
The effective Z is 73, so the K, X-ray energy is given by 
Equation: 
Ex, =ABR=E£, - k= E,- fi, 
where 
Equation: 
Ze (Ga 
and 
Equation: 
Z? (Ee 
Eo = ~ Sy = — (18.6 eV) = —18.1keV. 
Thus, 
Equation: 
Ex, = —18.1 keV — (—72.5 keV) = 54.4 keV. 
Significance 


This large photon energy is typical of X-rays. X-ray energies become progressively larger for 
heavier elements because their energy increases approximately as Z?. An acceleration 
voltage of more than 50,000 volts is needed to “knock out” an inner electron from a tungsten 
atom. 


X-ray Technology 


X-rays have many applications, such as in medical diagnostics ([link]), inspection of luggage 
at airports ([link]), and even detection of cracks in crucial aircraft components. The most 
common X-ray images are due to shadows. Because X-ray photons have high energy, they 
penetrate materials that are opaque to visible light. The more energy an X-ray photon has, the 
more material it penetrates. The depth of penetration is related to the density of the material, 
as well as to the energy of the photon. The denser the material, the fewer X-ray photons get 
through and the darker the shadow. X-rays are effective at identifying bone breaks and 
tumors; however, overexposure to X-rays can damage cells in biological organisms. 


(b) 


(a) An X-ray image of a person’s teeth. (b) A typical X-ray machine in a dentist’s office 
produces relatively low-energy radiation to minimize patient exposure. (credit a: 
modification of work by “Dmitry G”/Wikimedia Commons) 


An X-ray image of a piece of luggage. The denser the 
material, the darker the shadow. Object colors relate to 
material composition—metallic objects show up as 
blue in this image. (credit: “[Duke”/Wikimedia 
Commons) 


A standard X-ray image provides a two-dimensional view of the object. However, in medical 
applications, this view does not often provide enough information to draw firm conclusions. 
For example, in a two-dimensional X-ray image of the body, bones can easily hide soft tissues 
or organs. The CAT (computed axial tomography) scanner addresses this problem by 
collecting numerous X-ray images in “slices” throughout the body. Complex computer-image 
processing of the relative absorption of the X-rays, in different directions, can produce a 
highly detailed three-dimensional X-ray image of the body. 


X-rays can also be used to probe the structures of atoms and molecules. Consider X-rays 
incident on the surface of a crystalline solid. Some X-ray photons reflect at the surface, and 
others reflect off the “plane” of atoms just below the surface. Interference between these 
photons, for different angles of incidence, produces a beautiful image on a screen ({link]). The 
interaction of X-rays with a solid is called X-ray diffraction. The most famous example using 
X-ray diffraction is the discovery of the double-helix structure of DNA. 


. 


X-ray diffraction from the crystal of a protein (hen egg lysozyme) produced this 
interference pattern. Analysis of the pattern yields information about the structure of 
the protein. (credit: “Del45”/Wikimedia Commons) 


Summary 


e Radiation is absorbed and emitted by atomic energy-level transitions. 
¢ Quantum numbers can be used to estimate the energy, frequency, and wavelength of 
photons produced by atomic transitions. 


e Atomic fluorescence occurs when an electron in an atom is excited several steps above 
the ground state by the absorption of a high-energy ultraviolet (UV) photon. 

e X-ray photons are produced when a vacancy in an inner shell of an atom is filled by an 
electron from the outer shell of the atom. 

e The frequency of X-ray radiation is related to the atomic number Z of an atom. 


Conceptual Questions 


Exercise: 
Problem: 
Atomic and molecular spectra are discrete. What does discrete mean, and how are 


discrete spectra related to the quantization of energy and electron orbits in atoms and 
molecules? 


Solution: 


Atomic and molecular spectra are said to be “discrete,” because only certain spectral 
y 
lines are observed. In contrast, spectra from a white light source (consisting of man 
y 
photon frequencies) are continuous because a continuous “rainbow” of colors is 
observed. 


Exercise: 
Problem: 
Discuss the process of the absorption of light by matter in terms of the atomic structure 
of the absorbing medium. 

Exercise: 
Problem: 
NGC1763 is an emission nebula in the Large Magellanic Cloud just outside our Milky 
Way Galaxy. Ultraviolet light from hot stars ionize the hydrogen atoms in the nebula. As 


protons and electrons recombine, light in the visible range is emitted. Compare the 
energies of the photons involved in these two transitions. 


Solution: 


UV light consists of relatively high frequency (short wavelength) photons. So the energy 
of the absorbed photon and the energy transition (AF) in the atom is relatively large. In 
comparison, visible light consists of relatively lower-frequency photons. Therefore, the 
energy transition in the atom and the energy of the emitted photon is relatively small. 


Exercise: 


Problem: 
Why are X-rays emitted only for electron transitions to inner shells? What type of 
photon is emitted for transitions between outer shells? 
Exercise: 
Problem: 


How do the allowed orbits for electrons in atoms differ from the allowed orbits for 
planets around the sun? 


Solution: 


For macroscopic systems, the quantum numbers are very large, so the energy difference ( 
AE) between adjacent energy levels (orbits) is very small. The energy released in 
transitions between these closely space energy levels is much too small to be detected. 


Problems 


Exercise: 


Problem: 


What is the minimum frequency of a photon required to ionize: (a) a He* ion in its 
ground state? (b) A Li?" ion in its first excited state? 


Solution: 


For He’*, one electron “orbits” a nucleus with two protons and two neutrons (Z = 2). 
Ionization energy refers to the energy required to remove the electron from the atom. 
The energy needed to remove the electron in the ground state of He* ion to infinity is 
negative the value of the ground state energy, written: 

E= —544eV. 

Thus, the energy to ionize the electron is +54.4 eV. 

Similarly, the energy needed to remove an electron in the first excited state of Li?" ion to 
infinity is negative the value of the first excited state energy, written: 

E = —30.6 eV. 

The energy to ionize the electron is 30.6 eV. 


Exercise: 
Problem: 
The ion Li?* makes an atomic transition from an n = 4 state to ann = 2 state. (a) What 


is the energy of the photon emitted during the transition? (b) What is the wavelength of 
the photon? 


Exercise: 


Problem: 


The red light emitted by a ruby laser has a wavelength of 694.3 nm. What is the 
difference in energy between the initial state and final state corresponding to the 
emission of the light? 


Solution: 


The wavelength of the laser is given by: 


~ -AFE? 
where E, is the energy of the photon and AF is the magnitude of the energy difference. 


Solving for the latter, we get: 
AE = —2.795 eV. 
The negative sign indicates that the electron lost energy in the transition. 


Exercise: 


Problem: 


The yellow light from a sodium-vapor street lamp is produced by a transition of sodium 
atoms from a 3p state to a 3s state. If the difference in energies of those two states is 2.10 
eV, what is the wavelength of the yellow light? 


Exercise: 


Problem: Estimate the wavelength of the A, X-ray from calcium. 
Solution: 


AE, yx © (Z— 1)°(10.2 eV) = 3.68 x 103 eV. 


Exercise: 


Problem: Estimate the frequency of the K, X-ray from cesium. 
Exercise: 


Problem: 


X-rays are produced by striking a target with a beam of electrons. Prior to striking the 
target, the electrons are accelerated by an electric field through a potential energy 
difference: 


AU = —eAV, 


where e is the charge of an electron and AV is the voltage difference. If AV = 15,000 
volts, what is the minimum wavelength of the emitted radiation? 


Solution: 


According to the conservation of the energy, the potential energy of the electron is 
converted completely into kinetic energy. The initial kinetic energy of the electron is 
zero (the electron begins at rest). So, the kinetic energy of the electron just before it 
strikes the target is: 

k =eAV. 

If all of this energy is converted into braking radiation, the frequency of the emitted 


radiation is a maximum, therefore: 
—_ eAV 


max ~_ h 
When the emitted frequency is a maximum, then the emitted wavelength is a minimum, 
SO: 
Amin = 0.1293 nm. 


Exercise: 


Problem: 
For the preceding problem, what happens to the minimum wavelength if the voltage 
across the X-ray tube is doubled? 
Exercise: 
Problem: 


Suppose the experiment in the preceding problem is conducted with muons. What 
happens to the minimum wavelength? 


Solution: 
A muon is 200 times heavier than an electron, but the minimum wavelength does not 
depend on mass, so the result is unchanged. 

Exercise: 
Problem: 
An X-ray tube accelerates an electron with an applied voltage of 50 kV toward a metal 
target. (a) What is the shortest-wavelength X-ray radiation generated at the target? (b) 


Calculate the photon energy in eV. (c) Explain the relationship of the photon energy to 
the applied voltage. 


Exercise: 
Problem: 
A color television tube generates some X-rays when its electron beam strikes the screen. 
What is the shortest wavelength of these X-rays, if a 30.0-kV potential is used to 


accelerate the electrons? (Note that TVs have shielding to prevent these X-rays from 
exposing viewers.) 


Solution: 


4.13% 10°" m 
Exercise: 
Problem: 
An X-ray tube has an applied voltage of 100 kV. (a) What is the most energetic X-ray 


photon it can produce? Express your answer in electron volts and joules. (b) Find the 
wavelength of such an X-ray. 


Exercise: 
Problem: 
The maximum characteristic X-ray photon energy comes from the capture of a free 


electron into a K shell vacancy. What is this photon energy in keV for tungsten, 
assuming that the free electron has no initial kinetic energy? 


Solution: 


72.5 keV 


Exercise: 


Problem: What are the approximate energies of the Ky and Kg X-rays for copper? 


Exercise: 


Problem: Compare the X-ray photon wavelengths for copper and gold. 
Solution: 


The atomic numbers for Cu and Au are Z = 29 and 79, respectively. The X-ray photon 
frequency for gold is greater than copper by a factor: 


2 
fau \" _ (79-1)? — 

(4) a (35-7) ~ 8. 

Therefore, the X-ray wavelength of Au is about eight times shorter than for copper. 
Exercise: 

Problem: 

The approximate energies of the A, and Kg X-rays for copper are Ex, = 8.00 keV and 

Ex, = 9.48 keV, respectively. Determine the ratio of X-ray frequencies of gold to 


copper, then use this value to estimate the corresponding energies of Ay and Kg X-rays 
for gold. 


Glossary 


braking radiation 
radiation produced by targeting metal with a high-energy electron beam (or radiation 
produced by the acceleration of any charged particle in a material) 


fluorescence 
radiation produced by the excitation and subsequent, gradual de-excitation of an electron 
in an atom 


Moseley’s law 
relationship between the atomic number and X-ray photon frequency for X-ray 
production 


Moseley plot 
plot of the atomic number versus the square root of X-ray frequency 


selection rules 
rules that determine whether atomic transitions are allowed or forbidden (rare) 


Lasers 
By the end of this section, you will be able to: 


e Describe the physical processes necessary to produce laser light 
e Explain the difference between coherent and incoherent light 
¢ Describe the application of lasers to a CD and Blu-Ray player 


A laser is device that emits coherent and monochromatic light. The light is 
coherent if photons that compose the light are in-phase, and 
monochromatic if the photons have a single frequency (color). When a gas 
in the laser absorbs radiation, electrons are elevated to different energy 
levels. Most electrons return immediately to the ground state, but others 
linger in what is called a metastable state. It is possible to place a majority 
of these atoms in a metastable state, a condition called a population 
inversion. 


When a photon of energy disturbs an electron in a metastable state ([link]), 
the electron drops to the lower-energy level and emits an addition photon, 
and the two photons proceed off together. This process is called stimulated 
emission. It occurs with relatively high probability when the energy of the 
incoming photon is equal to the energy difference between the excited and 
“de-excited” energy levels of the electron (AE = hf). Hence, the 
incoming photon and the photon produced by de-excitation have the same 
energy, hf. These photons encounter more electrons in the metastable state, 
and the process repeats. The result is a cascade or chain reaction of similar 
de-excitations. Laser light is coherent because all light waves in laser light 
share the same frequency (color) and the same phase (any two points of 
along a line perpendicular to the direction of motion are on the “same part” 
of the wave”). A schematic diagram of coherent and incoherent light wave 
pattern is given in [link]. 


Electrons in a metastable state 


aa 
aaa 


= 
= 


The physics of a laser. An incident photon of frequency f causes a 
cascade of photons of the same frequency. 


Coherent light wave pattern Incoherent light wave pattern 


A coherent light wave pattern contains light waves of the same 
frequency and phase. An incoherent light wave pattern contains light 
waves of different frequencies and phases. 


Lasers are used in a wide range of applications, such as in communication 
(optical fiber phone lines), entertainment (laser light shows), medicine 
(removing tumors and cauterizing vessels in the retina), and in retail sales 
(bar code readers). Lasers can also be produced by a large range of 
materials, including solids (for example, the ruby crystal), gases (helium- 
gas mixture), and liquids (organic dyes). Recently, a laser was even created 


using gelatin—an edible laser! Below we discuss two practical applications 
in detail: CD players and Blu-Ray Players. 


CD Player 


A CD player reads digital information stored on a compact disc (CD). A 
CD is 6-inch diameter disc made of plastic that contains small “bumps” and 
“pits” nears its surface to encode digital or binary data ({link]). The bumps 
and pits appear along a very thin track that spirals outwards from the center 
of the disc. The width of the track is smaller than 1/20th the width of a 
human hair, and the heights of the bumps are even smaller yet. 


; Lacquer 
Pit Bump 


Aluminum 


A compact disc is a plastic disc that uses bumps near its 
surface to encode digital information. The surface of the 
disc contains multiple layers, including a layer of aluminum 
and one of polycarbonate plastic. 


A CD player uses a laser to read this digital information. Laser light is 
suited to this purpose, because coherent light can be focused onto an 
incredibly small spot and therefore distinguish between bumps and pits in 
the CD. After processing by player components (including a diffraction 
grating, polarizer, and collimator), laser light is focused by a lens onto the 
CD surface. Light that strikes a bump (“land”) is merely reflected, but light 
that strikes a “pit” destructively interferes, so no light returns (the details of 
this process are not important to this discussion). Reflected light is 
interpreted as a “1” and unreflected light is interpreted as a “0.” The 
resulting digital signal is converted into an analog signal, and the analog 
signal is fed into an amplifier that powers a device such as a pair of 
headphones. The laser system of a CD player is shown in [link]. 


Laser 


A CD player and its laser component. 


Blu-Ray Player 


Like a CD player, a Blu-Ray player reads digital information (video or 
audio) stored on a disc, and a laser is used to record this information. The 
pits on a Blu-Ray disc are much smaller and more closely packed together 
than for a CD, so much more information can be stored. As a result, the 
resolving power of the laser must be greater. This is achieved using short 
wavelength (A = 405 nm) blue laser light—hence, the name “Blu-” Ray. 
(CDs and DVDs use red laser light.) The different pit sizes and player- 
hardware configurations of a CD, DVD, and Blu-Ray player are shown in 
[link]. The pit sizes of a Blu-Ray disk are more than twice as small as the 
pits on a DVD or CD. Unlike a CD, a Blu-Ray disc store data on a 
polycarbonate layer, which places the data closer to the lens and avoids 
readability problems. A hard coating is used to protect the data since it is so 
close to the surface. 


CD: 0.7 GB DVD: 4.7 GB Blu-Ray: 25 GB 


Laser Laser 
beam beam 
0.6 mm} 
A = 780 nm A = 650 nm A = 405 nm 


Comparison of laser resolution in a CD, DVD, and Blu-Ray Player. 


Summary 


e Laser light is coherent (monochromatic and “phase linked”) light. 


e Laser light is produced by population inversion and subsequent de- 
excitation of electrons in a material (solid, liquid, or gas). 
¢ CD and Blu-Ray players uses lasers to read digital information stored 


on discs. 


Key Equations 


Orbital angular momentum 


z-component of orbital angular 
momentum 


Radial probability density function 
Spin angular momentum 


z-component of spin angular 
momentum 


Electron spin magnetic moment 


Electron orbital magnetic dipole 
moment 


Potential energy associated with the 
magnetic 

interaction between the orbital 
magnetic dipole 

moment and an external magnetic 


field B 


L=VJ/l(l+1)h 
L,=mh 
P(r)dr = |{noo|?4ar?dr 


S=/s(s+1)i 


Maximum number of electrons in a N=41+2 
subshell of 
a hydrogen atom 


Selection rule for atomic transitions in 
a Al = +1 
hydrogen-like atom 


Moseley’s law for X-ray production (Z — 1) = constant J f 


Conceptual Questions 


Exercise: 


Problem: Distinguish between coherent and monochromatic light. 
Exercise: 


Problem: 


Why is a metastable state necessary for the production of laser light? 


Solution: 


Laser light relies on the process of stimulated emission. In this process, 
electrons must be prepared in an excited (upper) metastable state such 
that the passage of light through the system produces de-excitations 
and, therefore, additional light. 


Exercise: 


Problem: 


How does light from an incandescent light bulb differ from laser light? 


Exercise: 


Problem: 


How is a Blu-Ray player able to read more information that a CD 
player? 


Solution: 


A Blu-Ray player uses blue laser light to probe the bumps and pits of 
the disc and a CD player uses red laser light. The relatively short- 
wavelength blue light is necessary to probe the smaller pits and bumps 
on a Blu-ray disc; smaller pits and bumps correspond to higher storage 
densities. 


Exercise: 
Problem: 


What are the similarities and differences between a CD player and a 
Blu-Ray player? 


Problems 


Exercise: 


Problem: 


A carbon dioxide laser used in surgery emits infrared radiation with a 
wavelength of 10.6 zm. In 1.00 ms, this laser raised the temperature of 
1.00 cm? of flesh to 100 °C and evaporated it. (a) How many photons 
were required? You may assume that flesh has the same heat of 
vaporization as water. (b) What was the minimum power output during 
the flash? 


Solution: 


: 23 
a. If flesh has the same density as water, then we used 14 * 10 


photons. b. 2.52 MW 


Exercise: 


Problem: 


An excimer laser used for vision correction emits UV radiation with a 
wavelength of 193 nm. (a) Calculate the photon energy in eV. (b) 
These photons are used to evaporate corneal tissue, which is very 
similar to water in its properties. Calculate the amount of energy 
needed per molecule of water to make the phase change from liquid to 
gas. That is, divide the heat of vaporization in kJ/kg by the number of 
water molecules in a kilogram. (c) Convert this to eV and compare to 
the photon energy. Discuss the implications. 


Additional Problems 


Exercise: 
Problem: 
For a hydrogen atom in an excited state with principal quantum 


number n, show that the smallest angle that the orbital angular 
momentum vector can make with respect to the z-axis is 


6 = cos"! ( a), 


Solution: 


The smallest angle corresponds tol = n —-1landm=l=n-—1. 
1 n-1 


n 


Therefore 6 = cos 


Exercise: 
Problem: 
What is the probability that the 1s electron of a hydrogen atom is 
found between r = 0 andr = cw? 


Exercise: 


Problem: 


Sketch the potential energy function of an electron in a hydrogen atom. 
(a) What is the value of this function at r = 0? in the limit that r = oo 
? (b) What is unreasonable or inconsistent with the former result? 


Solution: 


a. According to [link], when r = 0, U (r) = —oo, and when 

r = +o0,U (r) = 0. b. The former result suggests that the electron 
can have an infinite negative potential energy. The quantum model of 
the hydrogen atom avoids this possibility because the probability 
density at r = 0 is zero. 


Exercise: 


Problem: 


Find the value of /, the orbital angular momentum quantum number, 
for the Moon around Earth. 


Exercise: 


Problem: 


Show that the maximum number of orbital angular momentum 
electron states in the nth shell of an atom is n. (Ignore electron spin.) 
(Hint: Make a table of the total number of orbital angular momentum 
states for each shell and find the pattern.) 


Solution: 


A formal solution using sums is somewhat complicated. However, the 
answer easily found by studying the mathematical pattern between the 
principal quantum number and the total number of orbital angular 
momentum states. 

For n = 1, the total number of orbital angular momentum states is 1; 
for n = 2, the total number is 4; and, when n = 3, the total number is 
9, and so on. The pattern suggests the total number of orbital angular 


momentum states for the nth shell is n?. 

(Later, when we consider electron spin, the total number of angular 
momentum states will be found to be 2n?, because each orbital angular 
momentum states is associated with two states of electron spin; spin up 
and spin down). 


Exercise: 


Problem: What is the magnitude of an electron magnetic moment? 
Exercise: 


Problem: 
What is the maximum number of electron states in the n = 5 shell? 


Solution: 


50 
Exercise: 
Problem: 
A ground-state hydrogen atom is placed in a uniform magnetic field, 
and a photon is emitted in the transition from a spin-up to spin-down 


state. The wavelength of the photon is 168 zm. What is the strength of 
the magnetic field? 


Exercise: 


Problem: 


Show that the maximum number of electron states in the nth shell of 
an atom is 2n?. 


Solution: 


The maximum number of orbital angular momentum electron states in 
the nth shell of an atom is n?. Each of these states can be filled by a 


spin up and spin down electron. So, the maximum number of electron 
states in the nth shell is 2n?. 


Exercise: 


Problem: 


The valence electron of chlorine is excited to a 3p state. (a) What is the 
magnitude of the electron’s orbital angular momentum? (b) What are 
possible values for the z-component of angular measurement? 


Exercise: 


Problem: 


Which of the following notations are allowed (that is, which violate 
none of the rules regarding values of quantum numbers)? (a) 1s?; (b) 
1d°; (c) 4s?; (d) 3p"; (e) 6h? 


Solution: 


a., c., and e. are allowed; the others are not allowed. b. 1 > 7 is not 
allowed. 
d.7 > 2(21 4+ 1) 


Exercise: 


Problem: 


The ion Be®* makes an atomic transition from an n = 3 state to an 
nm = 2 state. (a) What is the energy of the photon emitted during the 
transition? (b) What is the wavelength of the photon? 


Exercise: 


Problem: 


The maximum characteristic X-ray photon energy comes from the 
capture of a free electron into a K shell vacancy. What is this photon 
frequency for tungsten, assuming that the free electron has no initial 
kinetic energy? 


Solution: 


f=1.8 x 10° Hz 
Exercise: 
Problem: 
Derive an expression for the ratio of X-ray photon frequency for two 
elements with atomic numbers Z; and Zo. 
Exercise: 


Problem: 


Compare the X-ray photon wavelengths for copper and silver. 


Solution: 


The atomic numbers for Cu and Ag are Z = 29 and 47, respectively. 
The X-ray photon frequency for silver is greater than copper by the 
following factor: 


2 
Sag \ 
( i ) ve 
Therefore, the X-ray wavelength of Ag is about three times shorter 
than for copper. 


Exercise: 
Problem: 
(a) What voltage must be applied to an X-ray tube to obtain 0.0100- 
fm-wavelength X-rays for use in exploring the details of nuclei? (b) 


What is unreasonable about this result? (c) Which assumptions are 
unreasonable or inconsistent? 


Exercise: 


Problem: 


A student in a physics laboratory observes a hydrogen spectrum with a 
diffraction grating for the purpose of measuring the wavelengths of the 
emitted radiation. In the spectrum, she observes a yellow line and finds 
its wavelength to be 589 nm. (a) Assuming that this is part of the 
Balmer series, determine n;, the principal quantum number of the 
initial state. (b) What is unreasonable about this result? (c) Which 
assumptions are unreasonable or inconsistent? 


Solution: 


a. 3.24; b. n; is not an integer. c. The wavelength must not be correct. 
Because n; > 2, the assumption that the line was from the Balmer 
series is possible, but the wavelength of the light did not produce an 
integer value for n;. If the wavelength is correct, then the assumption 
that the gas is hydrogen is not correct; it might be sodium instead. 


Glossary 


coherent light 
light that consists of photons of the same frequency and phase 


laser 
coherent light produced by a cascade of electron de-excitations 


metastable state 
state in which an electron “lingers” in an excited state 


monochromatic 
light that consists of photons with the same frequency 


population inversion 
condition in which a majority of atoms contain electrons in a 
metastable state 


stimulated emission 


when a photon of energy triggers an electron in a metastable state to 
drop in energy emitting an additional photon 


Introduction 
class="introduction" 


The 
crystalline 
structure of 

quartz allows 
it to cleave 
into smooth 
planes that 
refract light, 
making it 
suitable for 
jewelry. 
Silicon, the 
main element 
in quartz, also 
forms crystals 
in its pure 
form, and 
these crystals 
form the basis 
for the 
worldwide 
semiconducto 
r electronics 
industry. 
(credit left: 
modification 
of work by 
the United 
States 
Geological 
Survey) 


In this chapter, we examine applications of quantum mechanics to more 
complex systems, such as molecules, metals, semiconductors, and 
superconductors. We review and develop concepts of the previous chapters, 
including wave functions, orbitals, and quantum states. We also introduce 
many new concepts, including covalent bonding, rotational energy levels, 
Fermi energy, energy bands, doping, and Cooper pairs. 


The main topic in this chapter is the crystal structure of solids. For 
centuries, crystalline solids have been prized for their beauty, including 
gems like diamonds and emeralds, as well as geological crystals of quartz 
and metallic ores. But the crystalline structures of semiconductors such as 
silicon have also made possible the electronics industry of today. In this 
chapter, we study how the structures of solids give them properties from 
strength and transparency to electrical conductivity. 


Types of Molecular Bonds 
By the end of this section, you will be able to: 


¢ Distinguish between the different types of molecular bonds 

e Determine the dissociation energy of a molecule using the concepts 
ionization energy, electron affinity, and Coulomb force 

¢ Describe covalent bonding in terms of exchange symmetry 

e Explain the physical structure of a molecule in terms of the concept of 
hybridization 


Quantum mechanics has been extraordinarily successful at explaining the 
structure and bonding in molecules, and is therefore the foundation for all of 
chemistry. Quantum chemistry, as it is sometimes called, explains such basic 
questions as why H2O molecules exist, why the bonding angle between hydrogen 
atoms in this molecule is precisely 104.5°, and why these molecules bind 
together to form liquid water at room temperature. Applying quantum mechanics 
to molecules can be very difficult mathematically, so our discussion will be 
qualitative only. 


As we study molecules and then solids, we will use many different scientific 
models. In some cases, we look at a molecule or crystal as a set of point nuclei 
with electrons whizzing around the outside in well-defined trajectories, as in the 
Bohr model. In other cases, we employ our full knowledge of quantum mechanics 
to study these systems using wave functions and the concept of electron spin. It is 
important to remember that we study modern physics with models, and that 
different models are useful for different purposes. We do not always use the most 
powerful model, when a less-powerful, easier-to-use model will do the job. 


Types of Bonds 


Chemical units form by many different kinds of chemical bonds. An ionic bond 
forms when an electron transfers from one atom to another. A covalent bond 
occurs when two or more atoms share electrons. A van der Waals bond occurs 
due to the attraction of charge-polarized molecules and is considerably weaker 
than ionic or covalent bonds. Many other types of bonding exist as well. Often, 
bonding occurs via more than one mechanism. The focus of this section is ionic 
and covalent bonding. 


Ionic bonds 


The ionic bond is perhaps the easiest type of bonding to understand. It explains 
the formation of salt compounds, such as sodium chloride, NaCl. The sodium 
atom (symbol Na) has the same electron arrangement as a neon atom plus one 3s 
electron. Only 5.14 eV of energy is required to remove this one electron from the 
sodium atom. Therefore, Na can easily give up or donate this electron to an 
adjacent (nearby) atom, attaining a more stable arrangement of electrons. 
Chlorine (symbol Cl) requires just one electron to complete its valence shell, so it 
readily accepts this electron if it is near the sodium atom. We therefore say that 
chlorine has a large electron affinity, which is the energy associated with an 
accepted electron. The energy given up by the chlorine atom in this process is 
3.62 eV. After the electron transfers from the sodium atom to the chlorine atom, 
the sodium atom becomes a positive ion and the chlorine atom becomes a 
negative ion. The total energy required for this transfer is given by 

Equation: 


Etransfer = 5.14eV — 3.62 eV = 1.52 eV. 


The positive sodium ion and negative chloride ion experience an attractive 
Coulomb force. The potential energy associated with this force is given by 


Note: 
Equation: 


ke? 
U sept ae 
TO 


where ke? = 1.440 eV-nm and 779 is the distance between the ions. 


As the sodium and chloride ions move together (“descend the potential energy 
hill”), the force of attraction between the ions becomes stronger. However, if the 
ions become too close, core-electron wave functions in the two ions begin to 
overlap. Due to the exclusion principle, this action promotes the core electrons— 
and therefore the entire molecule—into a higher energy state. The equilibrium 


separation distance (or bond length) between the ions occurs when the molecule 
is in its lowest energy state. For diatomic NaCl, this distance is 0.236 nm. [link] 
shows the total energy of NaCl as a function of the distance of separation between 
ions. 


Pauli repulsion 


Energy (eV) 


Coulomb potential 


| I I | | 
0 0.2 0.4 0.6 0.8 10 


Separation (nm) 
Graph of energy versus ionic separation for sodium 


chloride. Equilibrium separation occur when the 
total energy is a minimum (—4.36 eV). 


The total energy required to form a single salt unit is 


Note: 
Equation: 


Oe = i eaneter = Olen =e Cee 


where U,x is the energy associated with the repulsion between core electrons due 
to Pauli’s exclusion principle. The value of U¢orm must be negative for the bond to 
form spontaneously. The dissociation energy is defined as the energy required to 
separate the unit into its constituent ions, written 

Equation: 


U dike = =Uteen 


Every diatomic formula unit has its own characteristic dissociation energy and 
equilibrium separation length. Sample values are given in [link]. 


Dissociation Equilibrium Separation (nm) 
Molecule Energy(eV) (Bond length) 
NaCl 4.26 0.236 
NaF 4.99 0.193 
NaBr 3.8 0.250 
Nal 3.1 0.271 
NaH 2.08 0.189 
LiCl 4.86 0.202 
Li 2.47 0.239 
Lil 3.67 0.238 
KCl 4.43 0.267 


KBr 3.97 0.282 


Dissociation Equilibrium Separation (nm) 


Molecule Energy(eV) (Bond length) 
RbF 5.12 0.227 
RbCl 4.64 0.279 
CsI 3.57 0.337 
H-H 4.5 0.075 
N-N 9.8 0.11 
O-O 5.2 0.12 
F-F 1.6 0.14 
Cl-Cl 2.5 0.20 

Bond Length 

Example: 

The Energy of Salt 


What is the dissociation energy of a salt formula unit (NaCl)? 

Strategy 

Sodium chloride (NaCl) is a salt formed by ionic bonds. The energy change 
associated with this bond depends on three main processes: the ionization of Na; 
the acceptance of the electron from a Na atom by a Cl atom; and Coulomb 
attraction of the resulting ions (Na‘ and Cl). If the ions get too close, they 
repel due to the exclusion principle (0.32 eV). The equilibrium separation 
distance is rg = 0.236 nm. 

Solution 

The energy change associated with the transfer of an electron from Na to Cl is 
1.52 eV, as discussed earlier in this section. At equilibrium separation, the atoms 
are ro = 0.236 nm apart. The electrostatic potential energy of the atoms is 
Equation: 


ee ALUN 2 
Oe ase eT Ve 
T0 0.236 nm 


The total energy difference associated with the formation of a NaCl formula unit 
is 
Equation: 


Ftorm = Ext, + Ucow + Uex = 1.52eV + (—6.10 eV) + 0.32 eV = —4.26 eV. 


Therefore, the dissociated energy of NaCl is 4.26 eV. 

Significance 

The formation of a NaCl formula unit by ionic bonding is energetically 
favorable. The dissociation energy, or energy required to separate the NaCl unit 
into Na‘ and Cl” ions is 4.26 eV, consistent with [link]. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Why is the potential energy associated with 
the exclusion principle positive in [link]? 


Solution: 


It corresponds to a repulsive force between core electrons in the ions. 


For a sodium ion in an ionic NaCl crystal, the expression for Coulomb potential 
energy U.ou must be modified by a factor known as the Madelung constant. 
This factor takes into account the interaction of the sodium ion with all nearby 
chloride and sodium ions. The Madelung constant for a NaCl crystal is about 
1.75. This value implies an equilibrium separation distance between 

Na‘ and Cl° ions of 0.280 nm—slightly larger than for diatomic NaCl. We will 
return to this point again later. 


Covalent bonds 


In an ionic bond, an electron transfers from one atom to another. However, in a 
covalent bond, an electron is shared between two atoms. The ionic bonding 
mechanism cannot explain the existence of such molecules as Hz, Og, and CO, 
since no separation distance exists for which the negative potential energy of 
attraction is greater in magnitude than the energy needed to create ions. 
Understanding precisely how such molecules are covalently bonded relies on a 
deeper understanding of quantum mechanics that goes beyond the coverage of 
this book, but we will qualitatively describe the mechanisms in the following 
section. 


Covalent bonds can be understood using the simple example of a Hj molecule, 


which consists of one electron in the electric field of two protons. This system can 
be modeled by an electron in a double square well ([{link]). The electron is equally 


likely to be found in each well, so the wave function is either symmetric or 
antisymmetric about a point midway between the wells. 


+ + + + 
(a) Symmetric (b) Antisymmetric 


A one-dimensional model of covalent bonding in a He 
molecule. (a) The symmetric wave function of the electron 
shared by the two positively charged protons (represented by 
the two finite square wells). (b) The corresponding 
antisymmetric wave function. 


Now imagine that the two wells are separated by a large distance. In the ground 
State, the wave function exists in one of two possible states: either a single 
positive peak (a sine wave-like “hump”) in both wells (symmetric case), or a 
positive peak in one well and a negative peak in the other (antisymmetric case). 
These states have the same energy. However, when the wells are brought together, 
the symmetric wave function becomes the ground state and the antisymmetric 
state becomes the first excited state—in other words, the energy level of the 
electron is split. Notice, the space-symmetric state becomes the energetically 
favorable (lower energy) state. 


The same analysis is appropriate for an electron bound to two hydrogen atoms. 
Here, the shapes of the ground-state wave functions have the form e 7/% or 
e(—||/20) in one dimension. The energetically favorable, space-symmetric state 
implies a high charge density midway between the protons where the electrons are 
likely to pull the positively charged protons together. 


If a second electron is added to this system to form a Hz molecule, the wave 
function must describe both particles, including their spatial relationship and 
relative spins. This wave function must also respect the indistinguishability of 
electrons. (“If you’ve seen one electron, you’ve seen them all.”) In particular, 
switching or exchanging the electrons should not produce an observable effect, a 
property called exchange symmetry. Exchange symmetry can be symmetric, 
producing no change in the wave function, or antisymmetric, producing an overall 
change in the sign of the wave function—neither of which is observable. 


As we discuss later, the total wave function of two electrons must be 
antisymmetric on exchange. For example, two electrons bound to a hydrogen 
molecule can be in a space-symmetric state with antiparallel spins (1) or space- 
antisymmetric state with parallel spins (t+). The state with antiparallel spins is 
energetically favorable and therefore used in covalent bonding. If the protons are 
drawn too closely together, however, repulsion between the protons becomes 
important. (In other molecules, this effect is supplied by the exclusion principle.) 
As aresult, Hy reaches an equilibrium separation of about 0.074 nm with a 
binding energy is 4.52 eV. 


Note: 
Visit this PBS Learning_Media tutorial and interactive simulation to explore the 
attractive and repulsive forces that act on atomic particles and covalent bonding 


in a Hy molecule. 


Quantum mechanics excludes many types of molecules. For example, the 
molecule H3 does not form, because if a third H atom approaches diatomic 
hydrogen, the wave function of the electron in this atom overlaps the electrons in 
the other two atoms. If all three electrons are in the ground states of their 
respective atoms, one pair of electrons shares all the same quantum numbers, 
which is forbidden by the exclusion principle. Instead, one of the electrons is 
forced into a higher energy state. No separation between three protons exists for 
which the total energy change of this process is negative—that is, where bonding 
occurs spontaneously. Similarly, He2 is not covalently bonded under normal 
conditions, because these atoms have no valence electrons to share. As the atoms 
are brought together, the wave functions of the core electrons overlap, and due to 
the exclusion principle, the electrons are forced into a higher energy state. No 
separation exists for which such a molecule is energetically favorable. 


Bonding in Polyatomic Molecules 


A polyatomic molecule is a molecule made of more than two atoms. Examples 
range from a simple water molecule to a complex protein molecule. The 
structures of these molecules can often be understood in terms of covalent 
bonding and hybridization. Hybridization is a change in the energy structure of 
an atom in which mixed states (states that can be written as a linear superposition 
of others) participate in bonding. 


To illustrate hybridization, consider the bonding in a simple water molecule, 
H,O. The electron configuration of oxygen is 1s?2s?2p*. The 1s and 2s 
electrons are in “closed shells” and do not participate in bonding. The remaining 
four electrons are the valence electrons. These electrons can fill six possible states 
(1 = 1,m = 0, +1, plus spin up and down). The energies of these states are the 
same, so the oxygen atom can exploit any linear combination of these states in 
bonding with the hydrogen atoms. These linear combinations (which you learned 
about in the chapter on atomic structure) are called atomic orbitals, and they are 
denoted by pz, py, and p,. The electron charge distributions for these orbitals are 
given in [link]. 


Py. Py p, 


Oxygen has four valence electrons. In the context of a water 
molecule, two valence electrons fill the p, orbital and one electron 
fills each of the pz and p, orbitals. The pz and p, orbitals are used 
in bonding with hydrogen atoms to form H2O. Without repulsion 
of H atoms, the bond angle between hydrogen atoms would be 90 

degrees. 


The transformation of the electron wave functions of oxygen to pz, py, and p, 
orbitals in the presence of the hydrogen atoms is an example of hybridization. 
Two electrons are found in the p, orbital with paired spins (7 |). One electron is 
found in each of the pz and p, orbitals, with unpaired spins. The latter orbitals 
participate in bonding with the hydrogen atoms. Based on [link], we expect the 
bonding angle for H—O—H to be 90°. However, if we include the effects of 
repulsion between atoms, the bond angle is 104.5°. The same arguments can be 
used to understand the tetrahedral shape of methane (CH,4) and other molecules. 


Summary 


e Molecules form by two main types of bonds: the ionic bond and the covalent 
bond. An ionic bond transfers an electron from one atom to another, and a 
covalent bond shares the electrons. 

e The energy change associated with ionic bonding depends on three main 
processes: the ionization of an electron from one atom, the acceptance of the 
electron by the second atom, and the Coulomb attraction of the resulting 
ions. 


¢ Covalent bonds involve space-symmetric wave functions. 
e Atoms use a linear combination of wave functions in bonding with other 
molecules (hybridization). 


Conceptual Questions 


Exercise: 
Problem: 


What is the main difference between an ionic bond, a covalent bond, and a 
van der Waals bond? 


Solution: 


An ionic bond is formed by the attraction of a positive and negative ion. A 
covalent bond is formed by the sharing of one or more electrons between 
atoms. A van der Waals bond is formed by the attraction of two electrically 
polarized molecules. 


Exercise: 
Problem: 
For the following cases, what type of bonding is expected? (a) KCl 
molecule; (b) Nz molecule. 


Exercise: 


Problem: Describe three steps to ionic bonding. 
Solution: 


1. An electron is removed from one atom. The resulting atom is a positive 
ion. 2. An electron is absorbed from another atom. The result atom is a 
negative ion. 3. The positive and negative ions are attracted together until an 
equilibrium separation is reached. 


Exercise: 


Problem: 


What prevents a positive and negative ion from having a zero separation? 


Exercise: 


Problem: 
For the Hz molecule, why must the spins the electron spins be antiparallel? 
Solution: 


Bonding is associated with a spatial function that is symmetric under 
exchange of the two electrons. In this state, the electron density is largest 
between the atoms. The total function must be antisymmetric (since electrons 
are fermions), so the spin function must be antisymmetric. In this state, the 
spins of the electrons are antiparallel. 


Problems 


Exercise: 
Problem: 
The electron configuration of carbon is 1s?2s?2p?. Given this electron 


configuration, what other element might exhibit the same type of 
hybridization as carbon? 


Exercise: 
Problem: 
Potassium chloride (KCl) is a molecule formed by an ionic bond. At 


equilibrium separation the atoms are r9 = 0.279 nm apart. Determine the 
electrostatic potential energy of the atoms. 


Solution: 


U == 5,16 eV 
Exercise: 
Problem: 
The electron affinity of Cl is 3.89 eV and the ionization energy of K is 4.34 


eV. Use the preceding problem to find the dissociation energy. (Neglect the 
energy of repulsion.) 


Exercise: 
Problem: 
The measured energy dissociated energy of KCl is 4.43 eV. Use the results of 


the preceding problem to determine the energy of repulsion of the ions due to 
the exclusion principle. 


Solution: 


—4.43 eV = —4.69 eV + Vex, Vex = 0.26 eV 


Glossary 


covalent bond 
bond formed by the sharing of one or more electrons between atoms 


dissociation energy 
amount of energy needed to break apart a molecule into atoms; also, total 
energy per ion pair to separate the crystal into isolated ions 


electron affinity 
energy associated with an accepted (bound) electron 


equilibrium separation distance 
distance between atoms in a molecule 


exchange symmetry 
how a total wave function changes under the exchange of two electrons 


hybridization 
change in the energy structure of an atom in which energetically favorable 
mixed states participate in bonding 


ionic bond 
bond formed by the Coulomb attraction of a positive and negative ions 


Madelung constant 
constant that depends on the geometry of a crystal used to determine the total 
potential energy of an ion in a crystal 


polyatomic molecule 
molecule formed of more than one atom 


van der Waals bond 
bond formed by the attraction of two electrically polarized molecules 


Molecular Spectra 
By the end of this section, you will be able to: 


e Use the concepts of vibrational and rotational energy to describe energy transitions in a 
diatomic molecule 

e Explain key features of a vibrational-rotational energy spectrum of a diatomic molecule 

e Estimate allowed energies of a rotating molecule 

¢ Determine the equilibrium separation distance between atoms in a diatomic molecule from 
the vibrational-rotational absorption spectrum 


Molecular energy levels are more complicated than atomic energy levels because molecules can 
also vibrate and rotate. The energies associated with such motions lie in different ranges and can 
therefore be studied separately. Electronic transitions are of order 1 eV, vibrational transitions are 
of order 10? eV, and rotational transitions are of order 10° eV. For complex molecules, these 
energy changes are difficult to characterize, so we begin with the simple case of a diatomic 
molecule. 


According to classical mechanics, the energy of rotation of a diatomic molecule is given by 
Equation: 


where I is the moment of inertia and L is the angular momentum. According to quantum 
mechanics, the rotational angular momentum is quantized: 
Equation: 


L=,/1(1+1)h(1=0, 1,2, 3....), 


where | is the orbital angular quantum number. The allowed rotational energy level of a 
diatomic molecule is therefore 


Note: 
Equation: 


2 
E, =1(1+ Ne =1(1+1)Eor (l=0,1,2,3....), 


where the characteristic rotational energy of a molecule is defined as 


Note: 
Equation: 


For a diatomic molecule, the moment of inertia with reduced mass yu is 


Note: 
Equation: 


are 


where 7 is the total distance between the atoms. The energy difference between rotational levels 
is therefore 
Equation: 


AE, = E41 — Ey = 2 (1 + 1) For. 


A detailed study of transitions between rotational energy levels brought about by the absorption 
or emission of radiation (a so-called electric dipole transition) requires that 


Note: 
Equation: 


b= 1, 


This rule, known as a selection rule, limits the possible transitions from one quantum state to 
another. [link] is the selection rule for rotational energy transitions. It applies only to diatomic 
molecules that have an electric dipole moment. For this reason, symmetric molecules such as H» 
and N» do not experience rotational energy transitions due to the absorption or emission of 
electromagnetic radiation. 


Example: 


The Rotational Energy of HCl 

Determine the lowest three rotational energy levels of a hydrogen chloride (HCl) molecule. 
Strategy 

Hydrogen chloride (HCl) is a diatomic molecule with an equilibrium separation distance of 
0.127 nm. Rotational energy levels depend only on the momentum of inertia J and the orbital 
angular momentum quantum number | (in this case, / = 0, 1, and 2). The momentum of inertia 
depends, in turn, on the equilibrium separation distance (which is given) and the reduced mass, 
which depends on the masses of the H and Cl atoms. 

Solution 

First, we compute the reduced mass. If Particle 1 is hydrogen and Particle 2 is chloride, we have 
Equation: 


1.0 u) (35.4 
ete Cane ee 2 =osra=o9re 
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The corresponding rest mass energy is therefore 
Equation: 


pc? = 9.06 x 10° eV. 


This allows us to calculate the characteristic energy: 
Equation: 


hn? hn? fic)? 197.3 eV - nm)” 
Ey == =—— = he) -= ayy ~ = 1.33 x 10-SeV. 
20 2(urs) 2 (ue?)rg =: 2 (9.06 x 108 eV) (0.127 nm) 


(Notice how this expression is written in terms of the rest mass energy. This technique is 
common in modern physics calculations.) The rotational energy levels are given by 
Equation: 


2 
B, = 10+ 1) =1(1+1)Eor, 


where | is the orbital quantum number. The three lowest rotational energy levels of an HCl 
molecule are therefore 


Equation: 
| = 0; E, = OeV (no rotation), 
Equation: 
l= 1;E, = 2 Eo, = 2.66 x 10-° eV, 
Equation: 


| = 2: EF, = 6 Eo, = 7.99 x 10? eV. 


Significance 


The rotational spectrum is associated with weak transitions (1/1000 to 1/100 of an eV). By 
comparison, the energy of an electron in the ground state of hydrogen is —13.6 eV. 


Note: 
Exercise: 


Problem: 


Check Your Understanding What does the energy separation between absorption lines in 
a rotational spectrum of a diatomic molecule tell you? 


Solution: 


the moment of inertia 


The vibrational energy level, which is the energy level associated with the vibrational energy of 
a molecule, is more difficult to estimate than the rotational energy level. However, we can 
estimate these levels by assuming that the two atoms in the diatomic molecule are connected by 
an ideal spring of spring constant k. The potential energy of this spring system is 

Equation: 


1 
Uose = yhAr”, 


Where Ar is a change in the “natural length” of the molecule along a line that connects the 
atoms. Solving Schrédinger’s equation for this potential gives 
Equation: 


1 
E, = (n+ >) tw (= 012 aes) 


Where w is the natural angular frequency of vibration and n is the vibrational quantum number. 
The prediction that vibrational energy levels are evenly spaced (AE = fw) turns out to be good 
at lower energies. 


A detailed study of transitions between vibrational energy levels induced by the absorption or 
emission of radiation (and the specifically so-called electric dipole transition) requires that 


Note: 
Equation: 


[link] represents the selection rule for vibrational energy transitions. As mentioned before, this 
rule applies only to diatomic molecules that have an electric dipole moment. Symmetric 
molecules do not experience such transitions. 


Due to the selection rules, the absorption or emission of radiation by a diatomic molecule 
involves a transition in vibrational and rotational states. Specifically, if the vibrational quantum 
number (n) changes by one unit, then the rotational quantum number (/) changes by one unit. An 
energy-level diagram of a possible transition is given in [link]. The absorption spectrum for such 
transitions in hydrogen chloride (HCI) is shown in [link]. The absorption peaks are due to 
transitions from the n = 0 to n = 1 vibrational states. Energy differences for the band of peaks 
at the left and right are, respectively, 

AF. 4141 = fiw + 2(14+ 1) For = hw + 2Eor, hw + 4Eo,, hw + 6Eo,,... (right band) and 
AF. 41-1 = fiw — 2Eop = hw — 2E or, hw — 4Eo,, hw — 6Eo,,... (left band). 


The moment of inertia can then be determined from the energy spacing between individual peaks 
(2Eo,) or from the gap between the left and right bands (4F,.). The frequency at the center of 
this gap is the frequency of vibration. 


W/ Excited electronic 
state 


Energy 


%/ Ground state 
Vibrational energy level 


Rotational level 


internuclear separation 


Three types of energy levels in a diatomic 
molecule: electronic, vibrational, and rotational. 
If the vibrational quantum number (n) changes by 
one unit, then the rotational quantum number (1) 
changes by one unit. 


Transitions where the vibrational energy Transitions where the vibrational energy 
increases (n = 0 —~ 1) increases (n = 0 —~ 1) 
and the rotational angular momentum and the rotational angular momentum 
decreases (j —> j — 1) increases (j —> j + 1) 


Intensity 


Center frequency 
forn=O—»n=1 
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Absorption spectrum of hydrogen chloride (HCl) from the n = 0 to n = 1 vibrational 
levels. The discrete peaks indicate a quantization of the angular momentum of the molecule. 
The bands to the left indicate a decrease in angular momentum, whereas those to the right 
indicate an increase in angular momentum. 


Summary 


e Molecules possess vibrational and rotational energy. 

e Energy differences between adjacent vibrational energy levels are larger than those between 
rotational energy levels. 

¢ Separation between peaks in an absorption spectrum is inversely related to the moment of 
inertia. 

¢ Transitions between vibrational and rotational energy levels follow selection rules. 


Conceptual Questions 


Exercise: 
Problem: 
Does the absorption spectrum of the diatomic molecule HCl depend on the isotope of 
chlorine contained in the molecule? Explain your reasoning. 


Exercise: 


Problem: 


Rank the energy spacing (AZ) of the following transitions from least to greatest: an 
electron energy transition in an atom (atomic energy), the rotational energy of a molecule, or 
the vibrational energy of a molecule? 


Solution: 


rotational energy, vibrational energy, and atomic energy 
Exercise: 


Problem: 


Explain key features of a vibrational-rotation energy spectrum of the diatomic molecule. 


Problems 


Exercise: 
Problem: 
In a physics lab, you measure the vibrational-rotational spectrum of HCI. The estimated 
separation between absorption peaks is Af + 5.5 x 10’ Hz. The central frequency of the 


band is fp = 9.0 x 10° Hz. (a) What is the moment of inertia (I)? (b) What is the energy 
of vibration for the molecule? 


Exercise: 
Problem: 


For the preceding problem, find the equilibrium separation of the H and Cl atoms. Compare 
this with the actual value. 


Solution: 
The measured value is 0.484 nm, and the actual value is close to 0.127 nm. The laboratory 
results are the same order of magnitude, but a factor 4 high. 
Exercise: 
Problem: 
The separation between oxygen atoms in an O, molecule is about 0.121 nm. Determine the 
characteristic energy of rotation in eV. 
Exercise: 
Problem: 


The characteristic energy of the Nz molecule is 2.48 x 10 * eV. Determine the separation 
distance between the nitrogen atoms 


Solution: 


0.110 nm 
Exercise: 
Problem: 
The characteristic energy for KCl is 1.4 x 10~° eV. (a) Determine yu for the KCI molecule. 
(b) Find the separation distance between the K and Cl atoms. 
Exercise: 
Problem: 


A diatomic F2 molecule is in the / = 1 state. (a) What is the energy of the molecule? (b) 
How much energy is radiated in a transition from al = 2 toal = 1 state? 


Solution: 


a. =2.2 x 10°-*eV:b. AF =4.4 x 10°-*eV 
Exercise: 
Problem: 
In a physics lab, you measure the vibrational-rotational spectrum of potassium bromide 
(KBr). The estimated separation between absorption peaks is Af © 5.35 x 10!° Hz. The 


central frequency of the band is fy = 8.75 x 101? Hz. (a) What is the moment of inertia 
(1)? (b) What is the energy of vibration for the molecule? 


Glossary 


electric dipole transition 
transition between energy levels brought by the absorption or emission of radiation 


rotational energy level 
energy level associated with the rotational energy of a molecule 


selection rule 
rule that limits the possible transitions from one quantum state to another 


vibrational energy level 
energy level associated with the vibrational energy of a molecule 


Bonding in Crystalline Solids 
By the end of this section, you will be able to: 


e Describe the packing structures of common solids 

e Explain the difference between bonding in a solid and in a molecule 

e Determine the equilibrium separation distance given crystal properties 
e Determine the dissociation energy of a salt given crystal properties 


Beginning in this section, we study crystalline solids, which consist of 
atoms arranged in an extended regular pattern called a lattice. Solids that do 
not or are unable to form crystals are classified as amorphous solids. 
Although amorphous solids (like glass) have a variety of interesting 
technological applications, the focus of this chapter will be on crystalline 
solids. 


Atoms arrange themselves in a lattice to form a crystal because of a net 
attractive force between their constituent electrons and atomic nuclei. The 
crystals formed by the bonding of atoms belong to one of three categories, 
classified by their bonding: ionic, covalent, and metallic. Molecules can 
also bond together to form crystals; these bonds, not discussed here, are 
classified as molecular. Early in the twentieth century, the atomic model of 
a solid was speculative. We now have direct evidence of atoms in solids 
({link]). 


An image made with a scanning tunneling microscope of the surface 
of graphite. The peaks represent the atoms, which are arranged in 
hexagons. The scale is in angstroms. 


Ionic Bonding in Solids 


Many solids form by ionic bonding. A prototypical example is the sodium 
chloride crystal, as we discussed earlier. Electrons transfer from sodium 
atoms to adjacent chlorine atoms, since the valence electrons in sodium are 
loosely bound and chlorine has a large electron affinity. The positively 
charged sodium ions and negatively charged chlorine (chloride) ions 
organize into an extended regular array of atoms ((Link)). 


Chloride ion 


Sodium ion 


Structure of the sodium chloride crystal. The sodium 
and chloride ions are arranged in a face-centered cubic 
(FCC) structure. 


The charge distributions of the sodium and chloride ions are spherically 
symmetric, and the chloride ion is about two times the diameter of the 
sodium ion. The lowest energy arrangement of these ions is called the face- 
centered cubic (FCC) structure. In this structure, each ion is closest to six 
ions of the other species. The unit cell is a cube—an atom occupies the 


center and corners of each “face” of the cube. The attractive potential 
energy of the Na ion due to the fields of these six Cl ions is written 
Equation: 


2 


Aner 


where the minus sign designates an attractive potential (and we identify 
k = 1/479). At a distance /2r are its next-nearest neighbors: twelve 
Na‘ ions of the same charge. The total repulsive potential energy 
associated with these ions is 

Equation: 


e2 


Uy, = 12 -—_—_. 
Arey 2r 


Next closest are eight Cl” ions a distance /3r from the Na* ion. The 
potential energy of the Na” ion in the field of these eight ions is 
Equation: 


2 
U3 = —8 ————_.. 
Anéy/3r 


Continuing in the same manner with alternate sets of Cl” and Na” ions, we 
find that the net attractive potential energy U, of the single Na* ion can be 
written as 
Equation: 


e2 


Ucout = 
Anmeor 


where a is the Madelung constant, introduced earlier. From this analysis, 
we can see that this constant is the infinite converging sum 


Equation: 


Distant ions make a significant contribution to this sum, so it converges 
slowly, and many terms must be used to calculate a accurately. For all FCC 
ionic solids, a is approximately 1.75. 


Other possible packing arrangements of atoms in solids include simple 
cubic and body-centered cubic (BCC). These three different packing 
structures of solids are compared in [link]. The first row represents the 
location, but not the size, of the ions; the second row indicates the unit cells 
of each structure or lattice; and the third row represents the location and 
size of the ions. The BCC structure has eight nearest neighbors, with a 
Madelung constant of about 1.76—only slightly different from that for the 
FCC structure. Determining the Madelung constant for specific solids is 
difficult work and the subject of current research. 


Locations of ions 
in unit cell 


Sizes of ions and parts 
alloted to each cell 


Unit cells within 
the overall lattice 


(a) Simple cubic (b) Body-centered cubic (c) Face-centered cubic 
) 


Packing structures for solids from left to right: (a) simple cubic, (b) 
body-centered cubic (BCC), and (c) face-centered cubic (FCC). Each 
crystal structure minimizes the energy of the system. 


The energy of the sodium ions is not entirely due to attractive forces 
between oppositely charged ions. If the ions are bought too close together, 
the wave functions of core electrons of the ions overlap, and the electrons 
repel due to the exclusion principle. The total potential energy of the Na* 
ion is therefore the sum of the attractive Coulomb potential (Uo) and the 
repulsive potential associated with the exclusion principle (U,,). 
Calculating this repulsive potential requires powerful computers. 


Fortunately, however, this energy can be described accurately by a simple 
formula that contains adjustable parameters: 


Note: 
Equation: 


where the parameters A and n are chosen to give predictions consistent with 
experimental data. For the problem at the end of this chapter, the parameter 
n is referred to as the repulsion constant. The total potential energy of the 
Na* ion is therefore 

Equation: 


5 A 
U = —a—— + —. 
4 Teor tad 


At equilibrium, there is no net force on the ion, so the distance between 
neighboring Na‘ and Cl” ions must be the value 79 for which U is a 
minimum. Setting —- = 0, we have 


Equation: 
> ae? nA 
Attegry? rortl 
Thus, 
Equation: 


Inserting this expression into the expression for the total potential energy, 
we have 
Equation: 


2 
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Notice that the total potential energy now has only one adjustable 
parameter, n. The parameter A has been replaced by a function involving ro, 
the equilibrium separation distance, which can be measured by a diffraction 
experiment (you learned about diffraction in a previous chapter). The total 
potential energy is plotted in [link] for n = 8, the approximate value of n 
for NaCl. 


Potential energy 


ro 


The potential energy of a sodium ion ina 

NaCl crystal for n = 8. The equilibrium 

bond length occurs when the energy is a 
minimized. 


As long as n > 1, the curve for U has the same general shape: U 
approaches infinity as r — 0 and U approaches zero as r — oo. The 
minimum value of the potential energy is given by 

Equation: 


The energy per ion pair needed to separate the crystal into ions is therefore 


Note: 
Equation: 


ke? il 
Ue ee pike (1 — =): 
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This is the dissociation energy of the solid. The dissociation energy can 
also be used to describe the total energy needed to break a mole of a solid 
into its constituent ions, often expressed in kJ/mole. The dissociation energy 
can be determined experimentally using the latent heat of vaporization. 
Sample values are given in the following table. 


F- Cl” Br- Te 
Lit 1036 853 807 757 
Nat 923 787 747 704 
Kt 821 715 682 649 


Rb* 785 689 660 630 


Cs? 740 659 631 604 


Lattice Energy for Alkali Metal Halides 


Thus, we can determine the Madelung constant from the crystal structure 
and n from the lattice energy. For NaCl, we have ro = 2.81 A, n & 8, and 
Uaiss = 7.84 eV /ion pair. This dissociation energy is relatively large. The 
most energetic photon from the visible spectrum, for example, has an 
energy of approximately 

Equation: 


hf = (4.14 x 10°" eV-s)(7.5 x 10'* Hz) = 3.1 eV. 


Because the ions in crystals are so tightly bound, ionic crystals have the 
following general characteristics: 


1. They are fairly hard and stable. 

2. They vaporize at relatively high temperatures (1000 to 2000 K). 

3. They are transparent to visible radiation, because photons in the visible 
portion of the spectrum are not energetic enough to excite an electron 
from its ground state to an excited state. 

4. They are poor electrical conductors, because they contain effectively 
no free electrons. 

5. They are usually soluble in water, because the water molecule has a 
large dipole moment whose electric field is strong enough to break the 
electrostatic bonds between the ions. 


Example: 

The Dissociation Energy of Salt 

Determine the dissociation energy of sodium chloride (NaCl) in kJ/mol. 
(Hint: The repulsion constant n of NaCl is approximately 8.) 

Strategy 

A sodium chloride crystal has an equilibrium separation of 0.282 nm. 
(Compare this value with 0.236 nm for a free diatomic unit of NaCl.) The 
dissociation energy depends on the separation distance, repulsion constant, 


and Madelung constant for an FCC structure. The separation distance 
depends in turn on the molar mass and measured density. We can 
determine the separation distance, and then use this value to determine the 
dissociation energy for one mole of the solid. 
Solution 
The atomic masses of Na and Cl are 23.0 u and 58.4 u, so the molar mass 
of NaCl is 58.4 g/mol. The density of NaCl is 2.16 g/ cm?. The 
relationship between these quantities is 
Equation: 

M M 
V 2N Ace 
where M is the mass of one mole of salt, Va is Avogadro’s number, and ro 
is the equilibrium separation distance. The factor 2 is needed since both the 
sodium and chloride ions represent a cubic volume ie Solving for the 
distance, we get 
Equation: 


M 58.4g¢/mol 


= ecm | 
2Nap 2 (6.03 x 107%) (2.160g/cm?) 


r9 


or 
Equation: 


To = 2.80 x 10-° cm = 0.280 nm. 


The potential energy of one ion pair (Na*Cl ) is 
Equation: 


where a is the Madelung constant, 79 is the equilibrium separation 
distance, and n is the repulsion constant. NaCl is FCC, so the Madelung 
constant is a = 1.7476. Substituting these values, we get 

Equation: 


er acta (1 =| ee! 


0.280 nm ion pair 


The dissociation energy of one mole of sodium chloride is therefore 
Equation: 


23.052 kcal 
D= (=| (= | = 182 kcal/mol = 760 kJ/mol. 


ion pair _leV 
ion pair 


Significance 

This theoretical value of the dissociation energy of 766 kJ/mol is close to 
the accepted experimental value of 787 kJ/mol. Notice that for larger 
density, the equilibrium separation distance between ion pairs is smaller, as 
expected. This small separation distance drives up the force between ions 
and therefore the dissociation energy. The conversion at the end of the 
equation took advantage of the conversion factor 1 kJ = 0.239 kcal. 


Note: 
Exercise: 


Problem: 


Check Your Understanding If the dissociation energy were larger, 
would that make it easier or more difficult to break the solid apart? 


Solution: 


more difficult 


Covalent Bonding in Solids 


Crystals can also be formed by covalent bonding. For example, covalent 
bonds are responsible for holding carbon atoms together in diamond 


crystals. The electron configuration of the carbon atom is 1s?2s?2p?—a He 
core plus four valence electrons. This electron configuration is four 
electrons short of a full shell, so by sharing these four electrons with other 
carbon atoms in a covalent bond, the shells of all carbon atoms are filled. 
Diamond has a more complicated structure than most ionic crystals ({link]). 
Each carbon atom is the center of a regular tetrahedron, and the angle 
between the bonds is 110°. This angle is a direct consequence of the 
directionality of the p orbitals of carbon atoms. 


Structure of the diamond crystal. (a) The single carbon atom 
represented by the dark blue sphere is covalently bonded to the four 
carbon atoms represented by the light blue spheres. (b) Gem-quality 
diamonds can be cleaved along smooth planes, which gives a large 
number of angles that cause total internal reflection of incident light, 

and thus gives diamonds their prized brilliance. 


Covalently bonded crystals are not as uniform as ionic crystals but are 
reasonably hard, difficult to melt, and are insoluble in water. For example, 
diamond has an extremely high melting temperature (4000 K) and is 
transparent to visible light. In comparison, covalently bonded tin (also 


known as alpha-tin, which is nonmetallic) is relatively soft, melts at 600 K, 
and reflects visible light. Two other important examples of covalently 
bonded crystals are silicon and germanium. Both of these solids are used 
extensively in the manufacture of diodes, transistors, and integrated circuits. 
We will return to these materials later in our discussion of semiconductors. 


Metallic Bonding in Solids 


As the name implies, metallic bonding is responsible for the formation of 
metallic crystals. The valence electrons are essentially free of the atoms and 
are able to move relatively easily throughout the metallic crystal. Bonding 
is due to the attractive forces between the positive ions and the conduction 
electrons. Metallic bonds are weaker than ionic or covalent bonds, with 
dissociation energies in the range 1 — 3eV. 


Summary 


e Packing structures of common ionic salts include FCC and BCC. 

e The density of a crystal is inversely related to the equilibrium constant. 

e The dissociation energy of a salt is large when the equilibrium 
separation distance is small. 

e The densities and equilibrium radii for common salts (FCC) are nearly 
the same. 


Conceptual Questions 


Exercise: 


Problem: 


Why is the equilibrium separation distance between K* and Cl™ 
different for a diatomic molecule than for solid KCI? 


Solution: 


Each ion is in the field of multiple ions of the other opposite charge. 


Exercise: 


Problem: 


Describe the difference between a face-centered cubic structure (FCC) 
and a body-centered cubic structure (BCC). 


Exercise: 


Problem: 


In sodium chloride, how many Cl atoms are “nearest neighbors” of 
Na‘? How many Na‘ atoms are “nearest neighbors” of Cl” ? 


Solution: 


6, 6 
Exercise: 


Problem: 


In cesium iodide, how many Cl” atoms are “nearest neighbors” of 
Cs‘? How many Cs” atoms are “nearest neighbors” of Cl” ? 


Exercise: 


Problem: 


The NaCl crystal structure is FCC. The equilibrium spacing is 
ro = 0.282 nm. If each ion occupies a cubic volume of . estimate 
the distance between “nearest neighbor” Na" ions (center-to-center)? 


Solution: 


0.399 nm 


Problems 


Exercise: 


Problem: 


The CsI crystal structure is BCC. The equilibrium spacing is 
approximately ry = 0.46 nm. If Cs* ion occupies a cubic volume of 
ee what is the distance of this ion to its “nearest neighbor” I* ion? 


Solution: 


0.65 nm 
Exercise: 
Problem: 
The potential energy of a crystal is —8.10 eV/ion pair. Find the 
dissociation energy for four moles of the crystal. 
Exercise: 


Problem: 


The measured density of a NaF crystal is 2.558 g/ cm. What is the 
equilibrium separate distance of Na* and Fl” ions? 


Solution: 


ro = 0.240 nm 
Exercise: 
Problem: 
What value of the repulsion constant, n, gives the measured 
dissociation energy of 221 kcal/mole for NaF? 
Exercise: 
Problem: 


Determine the dissociation energy of 12 moles of sodium chloride 
(NaCl). (Hint: the repulsion constant n is approximately 8.) 


Solution: 


2196 kcal 
Exercise: 
Problem: 
The measured density of a KCI crystal is 1.984 g/ cm. What is the 
equilibrium separation distance of K* and Cl” ions? 
Exercise: 


Problem: 


What value of the repulsion constant, n, gives the measured 
dissociation energy of 171 kcal/mol for KCl? 


Solution: 


11:5 
Exercise: 
Problem: 


The measured density of a CsCl crystal is 3.988 g/ cm?®. What is the 
equilibrium separate distance of Cs‘ and Cl” ions? 


Glossary 


body-centered cubic (BCC) 
crystal structure in which an ion is surrounded by eight nearest 
neighbors located at the corners of a unit cell 


face-centered cubic (FCC) 
crystal structure in which an ion is surrounded by six nearest neighbors 
located at the faces at the faces of a unit cell 


lattice 


regular array or arrangement of atoms into a crystal structure 


repulsion constant 
experimental parameter associated with a repulsive force between ions 
brought so close together that the exclusion principle is important 


simple cubic 
basic crystal structure in which each ion is located at the nodes of a 
three-dimensional grid 


Free Electron Model of Metals 
By the end of this section, you will be able to: 


e Describe the classical free electron model of metals in terms of the concept 
electron number density 

e Explain the quantum free-electron model of metals in terms of Pauli’s 
exclusion principle 

¢ Calculate the energy levels and energy-level spacing of a free electron in a 
metal 


Metals, such as copper and aluminum, are held together by bonds that are very 
different from those of molecules. Rather than sharing and exchanging 
electrons, a metal is essentially held together by a system of free electrons that 
wander throughout the solid. The simplest model of a metal is the free electron 
model. This model views electrons as a gas. We first consider the simple one- 
dimensional case in which electrons move freely along a line, such as through a 
very thin metal rod. The potential function U(x) for this case is a one- 
dimensional infinite square well where the walls of the well correspond to the 
edges of the rod. This model ignores the interactions between the electrons but 
respects the exclusion principle. For the special case of T’ = 0 K, N electrons 
fill up the energy levels, from lowest to highest, two at a time (spin up and spin 
down), until the highest energy level is filled. The highest energy filled is called 
the Fermi energy. 


The one-dimensional free electron model can be improved by considering the 
three-dimensional case: electrons moving freely in a three-dimensional metal 
block. This system is modeled by a three-dimensional infinite square well. 
Determining the allowed energy states requires us to solve the time-independent 
Schrédinger equation 

Equation: 


us 6g? 02 02 
fae us Oy? 23 oz )oe z) a Ey(2z,y, z), 


where we assume that the potential energy inside the box is zero and infinity 
otherwise. The allowed wave functions describing the electron’s quantum states 
can be written as 

Equation: 


2° » NgeNL 
9d) — as 


2. Nyry 2... enz 
— sin sin 
Ly Lis Ly, Ly 
where nz, ,, and n, are positive integers representing quantum numbers 
corresponding to the motion in the x-, y-, and z-directions, respectively, and 
L,,L,, and L, are the dimensions of the box in those directions. [link] is 


simply the product of three one-dimensional wave functions. The allowed 
energies of an electron in a cube (L = L, = L, = L,) are 


Note: 
Equation: 


nh? 


Associated with each set of quantum numbers (nz, n,,,) are two quantum 
states, spin up and spin down. In a real material, the number of filled states is 
enormous. For example, in a cubic centimeter of metal, this number is on the 
order of 1072. Counting how many particles are in which state is difficult work, 
which often requires the help of a powerful computer. The effort is worthwhile, 
however, because this information is often an effective way to check the model. 


Example: 

Energy of a Metal Cube 

Consider a solid metal cube of edge length 2.0 cm. (a) What is the lowest 
energy level for an electron within the metal? (b) What is the spacing between 
this level and the next energy level? 

Strategy 

An electron in a metal can be modeled as a wave. The lowest energy 
corresponds to the largest wavelength and smallest quantum number: 

Nz, Ny, nN, = (1, 1,1). [link] supplies this “ground state” energy value. Since 


the energy of the electron increases with the quantum number, the next highest 
level involves the smallest increase in the quantum numbers, or 

(Me, My, Mz) = (2,1,1), (1,2, 1), or (1, 1, 2). 

Solution 

The lowest energy level corresponds to the quantum numbers 

Ne = Ny = N, = 1. From [link], the energy of this level is 

Equation: 


B(1,1,1) = 45 (+1°+1’) 


_ 3n2 (1.05 x 10-34 J-s)? 
2 (9.11 x 10-*! kg) (2.00 x 10°? m)” 


—4.48 x 10°* J = 2.80 x 107 eV. 


The next-higher energy level is reached by increasing any one of the three 
quantum numbers by 1. Hence, there are actually three quantum states with the 
same energy. Suppose we increase n, by 1. Then the energy becomes 
Equation: 


B(2,1,1) = 237 (2? +141) 


_ 6n2(1.05 x 10—34 J-s)” 
2(9.11 x 1078! kg)(2.00 x 107? m)” 


— 8.96 x 10°* J =5.60 x 10° eV. 


The energy spacing between the lowest energy state and the next-highest 
energy state is therefore 
Equation: 


(216 = Ei) 2.80 < 10s eV. 


Significance 

This is a very small energy difference. Compare this value to the average 
kinetic energy of a particle, kgT’, where kp is Boltzmann’s constant and T is 
the temperature. The product kgT is about 1000 times greater than the energy 
Spacing. 


Note: 


Exercise: 


Problem: 


Check Your Understanding What happens to the ground state energy of 
an electron if the dimensions of the solid increase? 


Solution: 


It decreases. 


Often, we are not interested in the total number of particles in all states, but 
rather the number of particles dN with energies in a narrow energy interval. This 
value can be expressed by 

Equation: 


dN =n(E)dE = 9(E)dE-F 


where n(E) is the electron number density, or the number of electrons per unit 
volume; g(E) is the density of states, or the number of allowed quantum states 
per unit energy; dE is the size of the energy interval; and F is the Fermi factor. 
The Fermi factor is the probability that the state will be filled. For example, if 
g(E)dE is 100 available states, but F is only 5 %, then the number of particles in 
this narrow energy interval is only five. Finding g(E) requires solving 
Schrédinger’s equation (in three dimensions) for the allowed energy levels. The 
calculation is involved even for a crude model, but the result is simple: 


Note: 
Equation: 


where V is the volume of the solid, ™m, is the mass of the electron, and EF is the 
energy of the state. Notice that the density of states increases with the square 
root of the energy. More states are available at high energy than at low energy. 
This expression does not provide information of the density of the electrons in 
physical space, but rather the density of energy levels in “energy space.” For 
example, in our study of the atomic structure, we learned that the energy levels 
of a hydrogen atom are much more widely spaced for small energy values (near 
than ground state) than for larger values. 


This equation tells us how many electron states are available in a three- 
dimensional metallic solid. However, it does not tell us how likely these states 
will be filled. Thus, we need to determine the Fermi factor, F’. Consider the 
simple case of J’ = 0 K. From classical physics, we expect that all the electrons 
(~ 107? / cm?) would simply go into the ground state to achieve the lowest 
possible energy. However, this violates Pauli’s exclusion principle, which states 
that no two electrons can be in the same quantum state. Hence, when we begin 
filling the states with electrons, the states with lowest energy become occupied 
first, then states with progressively higher energies. The last electron we put in 
has the highest energy. This energy is the Fermi energy Fp of the free electron 
gas. A state with energy EF < Ep is occupied by a single electron, and a state 
with energy & > Ep is unoccupied. To describe this in terms of a probability 
F(E) that a state of energy E is occupied, we write for 7’ = 0 K: 

Equation: 


The density of states, Fermi factor, and electron number density are plotted 


against energy in [link]. 
g(E) fE) g(ENAE) 


| g(E-) 


g(E) « EV 


(a) (b) 


(a) Density of states for a free electron gas; (b) probability that a state is 
occupied at JT’ = 0 K; (c) density of occupied states at T = 0 K. 


A few notes are in order. First, the electron number density (last row) 
distribution drops off sharply at the Fermi energy. According to the theory, this 
energy is given by 


Note: 
Equation: 


Fermi energies for selected materials are listed in the following table. 


Conduction Band Electron Free-Electron Model 
Element Density (107° m~*) Fermi Energy (eV) 
Al 18.1 11.7 
Ba 3.15 3.64 
Cu 8.47 7.00 
Au 5.90 5.53 
Fe 17.0 11.1 
Ag 5.86 5.49 


Conduction Electron Densities and Fermi Energies for Some Metals 


Note also that only the graph in part (c) of the figure, which answers the 
question, “How many particles are found in the energy range?” is checked by 
experiment. The Fermi temperature or effective “temperature” of an electron 
at the Fermi energy is 


Note: 
Equation: 
Ey 
G nen em 
F ie 
Example: 


Fermi Energy of Silver 

Metallic silver is an excellent conductor. It has 5.86 x 107° conduction 
electrons per cubic meter. (a) Calculate its Fermi energy. (b) Compare this 
energy to the thermal energy kpT of the electrons at a room temperature of 300 
IK, 

Solution 


a. From [link], the Fermi energy is 
Equation: 


Er = J” (372n,)7/? 


2Me 


(1.05 x 10-84 J-s)? ; oe 
= 3011x10" ke) * [(8m-(5.86 x 10°° m2) 


= 8.79 x 10° J =5.49 eV. 


2/3 


This is a typical value of the Fermi energy for metals, as can be seen from 
[link]. 

b. We can associate a Fermi temperature 7 with the Fermi energy by 
writing kp 7p = Ep. We then find for the Fermi temperature 
Equation: 


8.79 x 107-19 J 


= a = 
3G. 10-2 Jk 


F 


which is much higher than room temperature and also the typical melting 
point (~ 10° K) of a metal. The ratio of the Fermi energy of silver to the 
room-temperature thermal energy is 

Equation: 


To visualize how the quantum states are filled, we might imagine pouring water 
slowly into a glass, such as that of [link]. The first drops of water (the electrons) 
occupy the bottom of the glass (the states with lowest energy). As the level 
rises, states of higher and higher energy are occupied. Furthermore, since the 
glass has a wide opening and a narrow stem, more water occupies the top of the 
glass than the bottom. This reflects the fact that the density of states g(E) is 
proportional to E'/2, so there is a relatively large number of higher energy 
electrons in a free electron gas. Finally, the level to which the glass is filled 
corresponds to the Fermi energy. 


Fermi energy E- 


Electron gas 


An analogy of how electrons fill energy states in a 
metal. As electrons fill energy states, lowest to highest, 
the number of available states increases. The highest 
energy state (corresponding to the water line) is the 
Fermi energy. (credit: modification of work by 
“Didriks”/Flickr) 


Suppose that at J’ = 0 K, the number of conduction electrons per unit volume 
in our sample is n,. Since each field state has one electron, the number of filled 
States per unit volume is the same as the number of electrons per unit volume. 


Summary 


e Metals conduct electricity, and electricity is composed of large numbers of 
randomly colliding and approximately free electrons. 

e The allowed energy states of an electron are quantized. This quantization 
appears in the form of very large electron energies, even at T = 0 K. 


¢ The allowed energies of free electrons in a metal depend on electron mass 
and on the electron number density of the metal. 

e The density of states of an electron in a metal increases with energy, 
because there are more ways for an electron to fill a high-energy state than 
a low-energy state. 

e Pauli’s exclusion principle states that only two electrons (spin up and spin 
down) can occupy the same energy level. Therefore, in filling these energy 
levels (lowest to highest at T = 0 K), the last and largest energy level to 
be occupied is called the Fermi energy. 


Conceptual Questions 


Exercise: 
Problem: 
Why does the Fermi energy (Ff) increase with the number of electrons in 
a metal? 
Exercise: 
Problem: 


If the electron number density (N/V) of a metal increases by a factor 8, 
what happens to the Fermi energy (Fp)? 


Solution: 


increases by a factor of 82 =4 
Exercise: 


Problem: 


Why does the horizontal line in the graph in [link] suddenly stop at the 
Fermi energy? 


Exercise: 


Problem: Why does the graph in [link] increase gradually from the origin? 


Solution: 


For larger energies, the number of accessible states increases. 
Exercise: 
Problem: 


Why are the sharp transitions at the Fermi energy “smoothed out” by 
increasing the temperature? 


Problems 


Exercise: 


Problem: 


What is the difference in energy between the nz = ny = n, = 4 state and 
the state with the next higher energy? What is the percentage change in the 
energy between then, = n, = nz = 4 State and the state with the next 
higher energy? (b) Compare these with the difference in energy and the 
percentage change in the energy between the nz = ny = n, = 400 state 
and the state with the next higher energy. 


Solution: 


a.4%;b.4.2 x 10°*%; for very large values of the quantum numbers, 
the spacing between adjacent energy levels is very small (“in the 
continuum”). This is consistent with the expectation that for large quantum 
numbers, quantum and classical mechanics give approximately the same 
predictions. 


Exercise: 
Problem: 
An electron is confined to a metal cube of ! = 0.8 cm on each side. 


Determine the density of states at (a) # = 0.80 eV; (b) & = 2.2 eV; and 
(c) EH = 5.0 eV. 


Exercise: 


Problem: 


What value of energy corresponds to a density of states of 
L107: 1077 EN. 2 


Solution: 


10.0:eV 


Exercise: 


Problem: Compare the density of states at 2.5 eV and 0.25 eV. 
Exercise: 
Problem: 
Consider a cube of copper with edges 1.50 mm long. Estimate the number 


of electron quantum states in this cube whose energies are in the range 
3.75 to 3.77 eV. 


Solution: 


4.55 x 10° 
Exercise: 


Problem: 


If there is one free electron per atom of copper, what is the electron 
number density of this metal? 


Exercise: 


Problem: 
Determine the Fermi energy and temperature for copper at 7’ = 0 K. 
Solution: 


Fermi energy, fe = 7.03 eV, Temperature, 7p = 8.2 x 104K 


Glossary 


density of states 
number of allowed quantum states per unit energy 


electron number density 
number of electrons per unit volume 


Fermi energy 
largest energy filled by electrons in a metal at 7’ = 0K 


Fermi factor 
number that expresses the probability that a state of given energy will be 
filled 


Fermi temperature 
effective temperature of electrons with energies equal to the Fermi energy 


free electron model 
model of a metal that views electrons as a gas 


Band Theory of Solids 
By the end of this section, you will be able to: 


e Describe two main approaches to determining the energy levels of an 
electron in a crystal 

e Explain the presence of energy bands and gaps in the energy structure 
of a crystal 

e Explain why some materials are good conductors and others are good 
insulators 

e Differentiate between an insulator and a semiconductor 


The free electron model explains many important properties of conductors 
but is weak in at least two areas. First, it assumes a constant potential 
energy within the solid. (Recall that a constant potential energy is 
associated with no forces.) [link] compares the assumption of a constant 
potential energy (dotted line) with the periodic Coulomb potential, which 
drops as —1/r at each lattice point, where r is the distance from the ion 
core (solid line). Second, the free electron model assumes an impenetrable 
barrier at the surface. This assumption is not valid, because under certain 
conditions, electrons can escape the surface—such as in the photoelectric 
effect. In addition to these assumptions, the free electron model does not 
explain the dramatic differences in electronic properties of conductors, 
semiconductors, and insulators. Therefore, a more complete model is 
needed. 


1) Approximate constant 
(7) potential energy 
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The periodic potential used to model electrons in a 
conductor. Each ion in the solid is the source of a Coulomb 
potential. Notice that the free electron model is productive 
because the average of this field is approximately constant. 


We can produce an improved model by solving Schrédinger’s equation for 
the periodic potential shown in [link]. However, the solution requires 
technical mathematics far beyond our scope. We again seek a qualitative 
argument based on quantum mechanics to find a way forward. 


We first review the argument used to explain the energy structure of a 
covalent bond. Consider two identical hydrogen atoms so far apart that 
there is no interaction whatsoever between them. Further suppose that the 
electron in each atom is in the same ground state: a 1s electron with an 
energy of —13.6 eV (ignore spin). When the hydrogen atoms are brought 


closer together, the individual wave functions of the electrons overlap and, 
by the exclusion principle, can no longer be in the same quantum state, 
which splits the original equivalent energy levels into two different energy 
levels. The energies of these levels depend on the interatomic distance, a 
([link]). 


If four hydrogen atoms are brought together, four levels are formed from 
the four possible symmetries—a single sine wave “hump” in each well, 
alternating up and down, and so on. In the limit of a very large number N of 
atoms, we expect a spread of nearly continuous bands of electronic energy 
levels in a solid (see [link](c)). Each of these bands is known as an energy 
band. (The allowed states of energy and wave number are still technically 
quantized, but for large numbers of atoms, these states are so close together 
that they are consider to be continuous or “in the continuum.”) 


Energy bands differ in the number of electrons they hold. In the 1s and 2s 
energy bands, each energy level holds up to two electrons (spin up and spin 
down), so this band has a maximum occupancy of 2N electrons. In the 2p 
energy band, each energy level holds up to six electrons, so this band has a 


maximum occupancy of 6N electrons ([link]). 
Ea E 


(a) (b) (c) 


The dependence of energy-level splitting on the average distance 
between (a) two atoms, (b) four atoms, and (c) a large number of 
atoms. For a large number of electrons, a continuous band of energies 
is produced. 


Energy 


es - 


A simple representation of the energy structure of a 
solid. Electrons belong to energy bands separated by 


energy gaps. 


Each energy band is separated from the other by an energy gap. The 
electrical properties of conductors and insulators can be understood in terms 
of energy bands and gaps. The highest energy band that is filled is known as 
a valence band. The next available band in the energy structure is known as 
a conduction band. In a conductor, the highest energy band that contains 
electrons is partially filled, whereas in an insulator, the highest energy band 
containing electrons is completely filled. The difference between a 
conductor and insulator is illustrated in [link]. 


A conductor differs from an insulator in how its electrons respond to an 
applied electric field. If a significant number of electrons are set into motion 
by the field, the material is a conductor. In terms of the band model, 


electrons in the partially filled conduction band gain kinetic energy from the 
electric field by filling higher energy states in the conduction band. By 
contrast, in an insulator, electrons belong to completely filled bands. When 
the field is applied, the electrons cannot make such transitions (acquire 
kinetic energy from the electric field) due to the exclusion principle. As a 
result, the material does not conduct electricity. 


Conduction band 


=— Energy gap 


Valence band 


Conductor: Insulator: 
Valence band filled, Valence band filled, 
conduction band unfilled conduction band filled 


(a) (b) 


Comparison of a conductor and insulator. The highest energy band is 
partially filled in a conductor but completely filled in an insulator. 


Note: 

Visit this simulation to learn about the origin of energy bands in crystals of 
atoms and how the structure of bands determines how a material conducts 
electricity. Explore how band structure creates a lattice of many wells. 


A semiconductor has a similar energy structure to an insulator except it has 
a relatively small energy gap between the lowest completely filled band and 
the next available unfilled band. This type of material forms the basis of 

modern electronics. At 7’ = 0 K, the semiconductor and insulator both have 


completely filled bands. The only difference is in the size of the energy gap 
(or band gap) E, between the highest energy band that is filled (the valence 
band) and the next-higher empty band (the conduction band). In a 
semiconductor, this gap is small enough that a substantial number of 
electrons from the valence band are thermally excited into the conduction 
band at room temperature. These electrons are then in a nearly empty band 
and can respond to an applied field. As a general rule of thumb, the band 
gap of a semiconductor is about 1 eV. (See [link] for silicon.) A band gap of 
greater than approximately 1 eV is considered an insulator. For comparison, 
the energy gap of diamond (an insulator) is several electron-volts. 


Material Energy Gap EF, (eV) 
Si 1.14 
Ge 0.67 
GaAs 1.43 
GaP 2.26 
GaSb 0.69 
InAs 0.35 
InP 1.35 
InSb 0.16 
C (diamond) 5.48 


Energy Gap for Various Materials at 300 KNote: Except for diamond, the 


materials licted are all camiceanductare 
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Summary 


e The energy levels of an electron in a crystal can be determined by 
solving Schrédinger’s equation for a periodic potential and by studying 
changes to the electron energy structure as atoms are pushed together 
from a distance. 

e The energy structure of a crystal is characterized by continuous energy 
bands and energy gaps. 

e The ability of a solid to conduct electricity relies on the energy 
structure of the solid. 


Conceptual Questions 


Exercise: 
Problem: 


What are the two main approaches used to determine the energy levels 
of electrons in a crystal? 


Solution: 


(1) Solve Schrédinger’s equation for the allowed states and energies. 
(2) Determine energy levels for the case of a very large lattice spacing 
and then determine the energy levels as this spacing is reduced. 


Exercise: 


Problem: 


Describe two features of energy levels for an electron in a crystal. 
Exercise: 
Problem: 


How does the number of energy levels in a band correspond to the 
number, N, of atoms. 


Solution: 


For N atoms spaced far apart, there are N different wave functions, all 
with the same energy (similar to the case of an electron in the double 
well of H2). As the atoms are pushed together, the energies of these N 
different wave functions are split. By the exclusion principle, each 
electron must each have a unique set of quantum numbers, so the N 
atoms bringing N electrons together must have at least N states. 


Exercise: 
Problem: 
Why are some materials very good conductors and others very poor 
conductors? 


Exercise: 


Problem: Why are some materials semiconductors? 
Solution: 


For a semiconductor, there is a relatively large energy gap between the 
lowest completely filled band and the next available unfilled band. 
Typically, a number of electrons traverse the gap and therefore the 
electrical conductivity is small. The properties of a semiconductor are 
sensitivity to temperature: As the temperature is increased, thermal 
excitations promote charge carriers from the valence band across the 
gap and into the conduction band. 


Exercise: 


Problem: 


Why does the resistance of a semiconductor decrease as the 
temperature increases? 


Problems 


Exercise: 
Problem: 
For a one-dimensional crystal, write the lattice spacing (a) in terms of 
the electron wavelength. 
Exercise: 
Problem: 


What is the main difference between an insulator and a 
semiconductor? 


Solution: 
For an insulator, the energy gap between the valence band and the 
conduction band is larger than for a semiconductor. 
Exercise: 
Problem: 
What is the longest wavelength for a photon that can excite a valence 
electron into the conduction band across an energy gap of 0.80 eV? 
Exercise: 
Problem: 
A valence electron in a crystal absorbs a photon of wavelength, 
A = 0.300 nm. This is just enough energy to allow the electron to 


jump from the valence band to the conduction band. What is the size 
of the energy gap? 


Solution: 


4.13 keV 


Glossary 


conduction band 
above the valence band, the next available band in the energy structure 
of a crystal 


energy band 
nearly continuous band of electronic energy levels in a solid 


energy gap 
gap between energy bands in a solid 


semiconductor 
solid with a relatively small energy gap between the lowest completely 
filled band and the next available unfilled band 


valence band 
highest energy band that is filled in the energy structure of a crystal 


Semiconductors and Doping 
By the end of this section, you will be able to: 


e Describe changes to the energy structure of a semiconductor due to 
doping 

e Distinguish between an n-type and p-type semiconductor 

e Describe the Hall effect and explain its significance 

¢ Calculate the charge, drift velocity, and charge carrier number density 
of a semiconductor using information from a Hall effect experiment 


In the preceding section, we considered only the contribution to the electric 
current due to electrons occupying states in the conduction band. However, 
moving an electron from the valence band to the conduction band leaves an 
unoccupied state or hole in the energy structure of the valence band, which 
a nearby electron can move into. As these holes are filled by other 
electrons, new holes are created. The electric current associated with this 
filling can be viewed as the collective motion of many negatively charged 
electrons or the motion of the positively charged electron holes. 


To illustrate, consider the one-dimensional lattice in [link]. Assume that 
each lattice atom contributes one valence electron to the current. As the 
hole on the right is filled, this hole moves to the left. The current can be 
interpreted as the flow of positive charge to the left. The density of holes, or 
the number of holes per unit volume, is represented by p. Each electron that 
transitions into the conduction band leaves behind a hole. If the conduction 
band is originally empty, the conduction electron density p is equal to the 
hole density, that is, n = p. 
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Flow of positive charge 


The motion of holes in a crystal lattice. 
As electrons shift to the right, an 
electron hole moves to the left. 


As mentioned, a semiconductor is a material with a filled valence band, an 
unfilled conduction band, and a relatively small energy gap between the 
bands. Excess electrons or holes can be introduced into the material by the 
substitution into the crystal lattice of an impurity atom, which is an atom 
of a slightly different valence number. This process is known as doping. 
For example, suppose we add an arsenic atom to a crystal of silicon ({link] 


(a)). 


-) = Silicon atoms 


*) = Arsenic atom 


@ = Extra electron from 
arsenic atom 


*) = Aluminum atom 


O =Hole 


(a) (b) 


(a) A donor impurity and (b) an acceptor impurity. The introduction to 


impurities and acceptors into a semiconductor significantly changes 
the electronic properties of this material. 


Arsenic has five valence electrons, whereas silicon has only four. This extra 
electron must therefore go into the conduction band, since there is no room 
in the valence band. The arsenic ion left behind has a net positive charge 
that weakly binds the delocalized electron. The binding is weak because the 
surrounding atomic lattice shields the ion’s electric field. As a result, the 
binding energy of the extra electron is only about 0.02 eV. In other words, 
the energy level of the impurity electron is in the band gap below the 
conduction band by 0.02 eV, a much smaller value than the energy of the 
gap, 1.14 eV. At room temperature, this impurity electron is easily excited 
into the conduction band and therefore contributes to the conductivity 
({link](a)). An impurity with an extra electron is known as a donor 
impurity, and the doped semiconductor is called an n-type semiconductor 
because the pamaly carriers of charge iereeons) ae negative. 


Conduction band ee@ @ Conduction band 
a | ————— Impurity band 
E, 
(a) (b) 


(a) The extra electron from a donor impurity is excited into the 
conduction band; (b) formation of an impurity band in an n-type 
semiconductor. 


By adding more donor impurities, we can create an impurity band, a new 
energy band created by semiconductor doping, as shown in [link](b). The 
Fermi level is now between this band and the conduction band. At room 
temperature, many impurity electrons are thermally excited into the 
conduction band and contribute to the conductivity. Conduction can then 
also occur in the impurity band as vacancies are created there. Note that 


changes in the energy of an electron correspond to a change in the motion 
(velocities or kinetic energy) of these charge carriers with the 
semiconductor, but not the bulk motion of the semiconductor itself. 


Doping can also be accomplished using impurity atoms that typically have 
one fewer valence electron than the semiconductor atoms. For example, Al, 
which has three valence electrons, can be substituted for Si, as shown in 
[link](b). Such an impurity is known as an acceptor impurity, and the 
doped semiconductor is called a p-type semiconductor, because the 
primary carriers of charge (holes) are positive. If a hole is treated as a 
positive particle weakly bound to the impurity site, then an empty electron 
State is created in the band gap just above the valence band. When this state 
is filled by an electron thermally excited from the valence band ([link](a)), a 
mobile hole is created in the valence band. By adding more acceptor 


impurities, we can create an impurity band, as shown in [link](b). 
Conduction band Conduction band 


= a ee eS Impurity band 


(a) (b) 


(a) An electron from the conduction band is excited into the empty 
state resulting from the acceptor impurity; (b) formation of an impurity 
band in a p-type semiconductor. 


The electric current of a doped semiconductor can be due to the motion of a 
majority carrier, in which holes are contributed by an impurity atom, or 
due to a minority carrier, in which holes are contributed purely by thermal 
excitations of electrons across the energy gap. In an n-type semiconductor, 
majority carriers are free electrons contributed by impurity atoms, and 
minority carriers are free electrons produced by thermal excitations from 
the valence to the conduction band. In a p-type semiconductor, the majority 
carriers are free holes contributed by impurity atoms, and minority carriers 
are free holes left by the filling of states due to thermal excitation of 


electrons across the gap. In general, the number of majority carriers far 
exceeds the minority carriers. The concept of a majority and minority 
carriers will be used in the next section to explain the operation of diodes 
and transistors. 


In studying p- and n-type doping, it is natural to ask: Do “electron holes” 
really act like particles? The existence of holes in a doped p-type 
semiconductor is demonstrated by the Hall effect. The Hall effect is the 
production of a potential difference due to the motion of a conductor 
through an external magnetic field (see The Hall Effect). A schematic of the 
Hall effect is shown in [link ](a). A semiconductor strip is bathed in a 
uniform magnetic field (which points into the paper). As the electron holes 
move from left to right through the semiconductor, a Lorentz force drives 
these charges toward the upper end of the strip. (Recall that the motion of 
the positively charged carriers is determined by the right-hand rule.) 
Positive charge continues to collect on the upper edge of the strip until the 
force associated with the downward electric field between the upper and 
lower edges of the strip (Fg = Eq) just balances the upward magnetic 
force (F'g = qvB). Setting these forces equal to each other, we have 

E = vB. The voltage that develops across the strip is therefore 


Note: 
Equation: 


Va = vBu, 


where Vy is the Hall voltage; v is the hole’s drift velocity, or average 
velocity of a particle that moves in a partially random fashion; B is the 
magnetic field strength; and w is the width of the strip. Note that the Hall 
voltage is transverse to the voltage that initially produces current through 
the material. A measurement of the sign of this voltage (or potential 
difference) confirms the collection of holes on the top side of the strip. The 


magnitude of the Hall voltage yields the drift velocity (v) of the majority 
carriers. 


Additional information can also be extracted from the Hall voltage. Note 
that the electron current density (the amount of current per unit cross- 
sectional area of the semiconductor strip) is 

Equation: 


j= nqu, 


where q is the magnitude of the charge, n is the number of charge carriers 
per unit volume, and v is the drift velocity. The current density is easily 
determined by dividing the total current by the cross-sectional area of the 
strip, q is charge of the hole (the magnitude of the charge of a single 
electron), and u is determined by the Hall effect [link]. Hence, the above 
expression for the electron current density gives the number of charge 
carriers per unit volume, n. A similar analysis can be conducted for 
negatively charged carriers in an n-type material (see [link]). 
| 
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(a) (b) 


The Hall effect. (a) Positively charged electron holes are drawn to the 
left by a uniform magnetic field that points downward. An electric 
field is generated to the right. (b) Negative charged electrons are 
drawn to the left by a magnetic field that points up. An electric field is 
generated to the left. 


Summary 


e The energy structure of a semiconductor can be altered by substituting 
one type of atom with another (doping). 

e Semiconductor n-type doping creates and fills new energy levels just 
below the conduction band. 

e Semiconductor p-type doping creates new energy levels just above the 
valence band. 

e The Hall effect can be used to determine charge, drift velocity, and 
charge carrier number density of a semiconductor. 


Conceptual Questions 


Exercise: 
Problem: 


What kind of semiconductor is produced if germanium is doped with 
(a) arsenic, and (b) gallium? 


Solution: 


a. Germanium has four valence electrons. If germanium doped with 
arsenic (five valence electrons), four are used in bonding and one 
electron will be left for conduction. This produces an n-type material. 
b. If germanium is doped with gallium (three valence electrons), all 
three electrons are used in bonding, leaving one hole for conduction. 
This results in a p-type material. 


Exercise: 
Problem: 
What kind of semiconductor is produced if silicon is doped with (a) 
phosphorus, and (b) indium? 


Exercise: 


Problem: What is the Hall effect and what is it used for? 


Solution: 


The Hall effect is the production of a potential difference due to 
motion of a conductor through an external magnetic field. This effect 
can be used to determine the drift velocity of the charge carriers 
(electrons or hole). If the current density is measured, this effect can 
also determine the number of charge carriers per unit volume. 


Exercise: 
Problem: 
For an n-type semiconductor, how do impurity atoms alter the energy 
structure of the solid? 
Exercise: 
Problem: 


For a p-type semiconductor, how do impurity atoms alter the energy 
structure of the solid? 


Solution: 


It produces new unfilled energy levels just above the filled valence 
band. These levels accept electrons from the valence band. 


Problems 


Exercise: 


Problem: 


An experiment is performed to demonstrate the Hall effect. A thin 
rectangular strip of semiconductor with width 10 cm and length 30 cm 
is attached to a battery and immersed in a 1.50-T field perpendicular to 
its surface. This produced a Hall voltage of 12 V. What is the drift 
velocity of the charge carriers? 


Exercise: 


Problem: 


Suppose that the cross-sectional area of the strip (the area of the face 
perpendicular to the electric current) presented to the in the preceding 
problem is 1 mm? and the current is independently measured to be 2 
mA. What is the number density of the charge carriers? 


Solution: 


n= 1.56 x 10! holes/m?® 
Exercise: 
Problem: 
A current-carrying copper wire with cross-section o = 2mm? has a 


drift velocity of 0.02 cm/s. Find the total current running through the 
wire. 


Exercise: 
Problem: 
The Hall effect is demonstrated in the laboratory. A thin rectangular 
strip of semiconductor with width 5 cm and cross-sectional area 
2 mm’ is attached to a battery and immersed in a field perpendicular to 


its surface. The Hall voltage reads 12.5 V and the measured drift 
velocity is 50 m/s. What is the magnetic field? 


Solution: 


aa fi 


Glossary 


acceptor impurity 
atom substituted for another in a semiconductor that results in a free 
electron 


donor impurity 
atom substituted for another in a semiconductor that results in a free 
electron hole 


doping 
alteration of a semiconductor by the substitution of one type of atom 
with another 


drift velocity 
average velocity of a randomly moving particle 


hole 
unoccupied states in an energy band 


impurity atom 
acceptor or donor impurity atom 


impurity band 
new energy band create by semiconductor doping 


majority carrier 
free electrons (or holes) contributed by impurity atoms 


minority carrier 
free electrons (or holes) produced by thermal excitations across the 


energy gap 


n-type semiconductor 
doped semiconductor that conducts electrons 


p-type semiconductor 
doped semiconductor that conducts holes 


Semiconductor Devices 
By the end of this section, you will be able to: 


e Describe what occurs when n- and p-type materials are joined together 
using the concept of diffusion and drift current (zero applied voltage) 

e Explain the response of a p-n junction to a forward and reverse bias 
voltage 

e Describe the function of a transistor in an electric circuit 

e Use the concept of a p-n junction to explain its applications in audio 
amplifiers and computers 


Semiconductors have many applications in modern electronics. We describe 
some basic semiconductor devices in this section. A great advantage of 
using semiconductors for circuit elements is the fact that many thousands or 
millions of semiconductor devices can be combined on the same tiny piece 
of silicon and connected by conducting paths. The resulting structure is 
called an integrated circuit (ic), and ic chips are the basis of many modern 
devices, from computers and smartphones to the internet and global 
communications networks. 


Diodes 


Perhaps the simplest device that can be created with a semiconductor is a 
diode. A diode is a circuit element that allows electric current to flow in 
only one direction, like a one-way valve (see Model of Conduction in 
Metals). A diode is created by joining a p-type semiconductor to an n-type 
semiconductor ([link]). The junction between these materials is called a p-n 
junction. A comparison of the energy bands of a silicon-based diode is 
shown in [link](b). The positions of the valence and conduction bands are 
the same, but the impurity levels are quite different. When a p-n junction is 
formed, electrons from the conduction band of the n-type material diffuse to 
the p-side, where they combine with holes in the valence band. This 
migration of charge leaves positive ionized donor ions on the n-side and 
negative ionized acceptor ions on the p-side, producing a narrow double 
layer of charge at the p-n junction called the depletion layer. The electric 
field associated with the depletion layer prevents further diffusion. The 
potential energy for electrons across the p-n junction is given by [link]. 


(a) 


Electrons at the 
bottom of the 
e@3e @ conduction band 


Holes at the top 
of the valence band 


(b) 


(a) Representation of a p-n junction. (b) A comparison of the energy 
bands of p-type and n-type silicon prior to equilibrium. 
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(a) 


Potential difference 
prevents diffusion of 
electrons from n side 
to p side. 


(b) 


At equilibrium, (a) excess charge resides near the interface and the net 
current is zero, and (b) the potential energy difference for electrons (in 
light blue) prevents further diffusion of electrons into the p-side. 


The behavior of a semiconductor diode can now be understood. If the 
positive side of the battery is connected to the n-type material, the depletion 
layer is widened, and the potential energy difference across the p-n junction 
is increased. Few or none of the electrons (holes) have enough energy to 
climb the potential barrier, and current is significantly reduced. This is 
called the reverse bias configuration. On the other hand, if the positive 
side of a battery is connected to the p-type material, the depletion layer is 
narrowed, the potential energy difference across the p-n junction is reduced, 
and electrons (holes) flow easily. This is called the forward bias 
configuration of the diode. In sum, the diode allows current to flow freely 
in one direction but prevents current flow in the opposite direction. In this 
sense, the semiconductor diode is a one-way valve. 


We can estimate the mathematical relationship between the current and 
voltage for a diode using the electric potential concept. Consider N 
negatively charged majority carriers (electrons donated by impurity atoms) 
in the n-type material and a potential barrier V across the p-n junction. 
According to the Maxwell-Boltzmann distribution, the fraction of electrons 
that have enough energy to diffuse across the potential barrier is NV ee het 
. However, if a battery of voltage V, is applied in the forward-bias 
configuration, this fraction improves to Ne~¢(Y—-)/sT_ The electric 
current due to the majority carriers from the n-side to the p-side is therefore 
Equation: 


io Ne eV /kst ec ¥o/keT = Tpee¥/ HT 


where Jo is the current with no applied voltage and T is the temperature. 
Current due to the minority carriers (thermal excitation of electrons from 
the valence band to the conduction band on the p-side and subsequent 
attraction to the n-side) is —Jo, independent of the bias voltage. The net 
current is therefore 


Note: 
Equation: 


eer (ex hyD 1). 


A sample graph of the current versus bias voltage is given in [link]. In the 
forward bias configuration, small changes in the bias voltage lead to large 
changes in the current. In the reverse bias configuration, the current is 
Inet  —Ip. For extreme values of reverse bias, the atoms in the material 
are ionized which triggers an avalanche of current. This case occurs at the 
breakdown voltage. 


net 


Breakdown 
voltage 


Reverse bias Forward bias 


Current versus voltage across a p-n junction (diode). In the 
forward bias configuration, electric current flows easily. However, 
in the reverse bias configuration, electric current flow very little. 


Example: 

Diode Current 

Attaching the positive end of a battery to the p-side and the negative end to 
the n-side of a semiconductor diode produces a current of 4.5 x 101A. 
The reverse saturation current is 2.2 x 10 8A. (The reverse saturation 
current is the current of a diode in a reverse bias configuration such as 
this.) The battery voltage is 0.12 V. What is the diode temperature? 
Strategy 

The first arrangement is a forward bias configuration, and the second is the 
reverse bias configuration. In either case, [link] gives the current. 

Solution 

The current in the forward and reverse bias configurations is given by 
Equation: 


| ee (ewes = 1). 
The current with no bias is related to the reverse saturation current by 
Equation: 
ip @ Sls = 22 x LO 


Therefore 
Equation: 


Inet 4.5 x 107° A 


— = 290 x 10°. 
Ip 22x 108A 


[link] can be written as 
Equation: 


Tet Hea ooVo/ kel 


This ratio is much greater than one, so the second term on the left-hand 
side of the equation vanishes. Taking the natural log of both sides gives 
Equation: 


eV, 


= 18. 
kpT 


The temperature is therefore 
Equation: 


_ =(a) _ e (0.12 V) (<5) 73K. 
kp \ 19 8.617 x 10 °eV/K \ 19 
Significance 
The current moving through a diode in the forward and reverse bias 
configuration is sensitive to the temperature of the diode. If the potential 
energy supplied by the battery is large compared to the thermal energy of 


the diode’s surroundings, kp7’, then the forward bias current is very large 
compared to the reverse saturation current. 


Note: 
Exercise: 


Problem: 


Check Your Understanding How does the magnitude of the forward 
bias current compare with the reverse bias current? 


Solution: 


The forward bias current is much larger. To a good approximation, 
diodes permit current flow in only one direction. 


Note: 

Create a p-n junction and observe the behavior of a simple circuit for 
forward and reverse bias voltages. Visit this site to learn more about 
semiconductor diodes. 


Junction Transistor 


If diodes are one-way valves, transistors are one-way valves that can be 
carefully opened and closed to control current. A special kind of transistor 
is a junction transistor. A junction transistor has three parts, including an 
n-type semiconductor, also called the emitter; a thin p-type semiconductor, 
which is the base; and another n-type semiconductor, called the collector 
({link]). When a positive terminal is connected to the p-type layer (the 
base), a small current of electrons, called the base current Ip, flows to the 
terminal. This causes a large collector current [, to flow through the 
collector. The base current can be adjusted to control the large collector 
current. The current gain is therefore 


Note: 
Equation: 


Il = PIp. 


Collector 
(n-type) 


Base 
(p-type) 


Emitter 
(n-type) Large 
electron 
flow 
(collector 


current) 


Small electron flow 
(base current) 


Base voltage 


A junction transistor has three parts: emitter, base, and 
collector. Voltage applied to the base acts as a valve to 
control electric current from the emitter to the collector. 


A junction transistor can be used to amplify the voltage from a microphone 
to drive a loudspeaker. In this application, sound waves cause a diaphragm 
inside the microphone to move in and out rapidly ([link]). When the 
diaphragm is in the “in” position, a tiny positive voltage is applied to the 
base of the transistor. This opens the transistor “valve” and allows a large 
electrical current flow to the loudspeaker. When the diaphragm is in the 
“out” position, a tiny negative voltage is applied to the base of the 
transistor, which shuts off the transistor valve so that no current flows to the 
loudspeaker. This shuts the transistor “valve” off so no current flows to the 
loudspeaker. In this way, current to the speaker is controlled by the sound 
waves, and the sound is amplified. Any electric device that amplifies a 
signal is called an amplifier. 


Loudspeaker 


Collector 
(n-type) 


Base 
(p-type) 


Emitter 
(n-type) 


Base Collector 
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current) 
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electron flow Small electron flow 


(base current) 


Base voltage 


Diaphragm moves out Diaphragm moves in 


~~ 
Microphone 
(7 
An audio amplifier based on a junction transistor. Voltage applied to 


the base by a microphone acts as a valve to control a larger electric 
current that passes through a loudspeaker. 


In modern electronic devices, digital signals are used with diodes and 
transistors to perform tasks such as data manipulation. Electric circuits 
carry two types of electrical signals: analog and digital ((link]). An analog 
signal varies continuously, whereas a digital signal switches between two 
fixed voltage values, such as plus 1 volt and zero volts. In digital circuits 
like those found in computers, a transistor behaves like an on-off switch. 
The transistor is either on, meaning the valve is completely open, or it is 
off, meaning the valve is completely closed. Integrated circuits contain vast 
collections of transistors on a single piece of silicon. They are designed to 
handle digital signals that represent ones and zeroes, which is also known as 
binary code. The invention of the ic helped to launch the modern computer 
revolution. 


Analog signal Digital signal 


Real-world data are often analog, meaning data can vary continuously. 
Intensity values of sound or visual images are usually analog. These 
data are converted into digital signals for electronic processing in 
recording devices or computers. The digital signal is generated from 
the analog signal by requiring certain voltage cut-off value. 


Summary 


e A diode is produced by an n-p junction. A diode allows current to 
move in just one direction. In forward biased configuration of a diode, 
the current increases exponentially with the voltage. 

e A transistor is produced by an n-p-n junction. A transistor is an electric 
valve that controls the current in a circuit. 

e A transistor is a critical component in audio amplifiers, computers, and 
many other devices. 


Conceptual Questions 


Exercise: 


Problem: 


When p- and n-type materials are joined, why is a uniform electric 
field generated near the junction? 


Exercise: 
Problem: 


When p- and n-type materials are joined, why does the depletion layer 
not grow indefinitely? 


Solution: 


The electric field produced by the uncovered ions reduces further 
diffusion. In equilibrium, the diffusion and drift currents cancel so the 
net current is zero. Therefore, the resistance of the depletion region is 
large. 


Exercise: 


Problem: 


How do you know if a diode is in the forward biased configuration? 
Exercise: 


Problem: 


Why does the reverse bias configuration lead to a very small current? 


Solution: 


The positive terminal is applied to the n-side, which uncovers more 
ions near the junction (widens the depletion layer), increases the 
junction voltage difference, and therefore reduces the diffusion of 
holes across the junction. 


Exercise: 
Problem: 
What happens in the extreme case that where the n- and p-type 
materials are heavily doped? 


Exercise: 


Problem: 
Explain how an audio amplifier works, using the transistor concept. 
Solution: 


Sound moves a diaphragm in and out, which varies the input or base 
current of the transistor circuit. The transistor amplifies this signal (p- 
n-p semiconductor). The output or collector current drives a speaker. 


Problems 


Exercise: 


Problem: Show that for V less than zero, Inet © —p. 
Exercise: 
Problem: 
A p-ndiode has a reverse saturation current 1.44 x 108A. Itis 
forward biased so that it has a current of 6.78 x 107! A moving 


through it. What bias voltage is being applied if the temperature is 300 
K? 


Solution: 


V, = 0.458 V 
Exercise: 
Problem: 
The collector current of a transistor is 3.4 A for a base current of 4.2 
mA. What is the current gain? 


Exercise: 


Problem: 


Applying the positive end of a battery to the p-side and the negative 
end to the n-side of a p-n junction, the measured current is 

8.76 x 101A. Reversing this polarity give a reverse saturation 
current of 4.41 x 10-8 A. What is the temperature if the bias voltage 
is 12-2 


Solution: 


T = 829K 
Exercise: 


Problem: 


The base current of a transistor is 4.4 A, and its current gain 1126. 
What is the collector current? 


Glossary 


amplifier 
electrical device that amplifies an electric signal 


base current 
current drawn from the base n-type material in a transistor 


breakdown voltage 
in a diode, the reverse bias voltage needed to cause an avalanche of 
current 


collector current 
current drawn from the collector p-type material 


depletion layer 
region near the p-n junction that produces an electric field 


forward bias configuration 


diode configuration that results in high current 


junction transistor 
electrical valve based on a p-n-p junction 


p-n junction 
junction formed by joining p- and n-type semiconductors 


reverse bias configuration 
diode configuration that results in low current 


Superconductivity 
By the end of this section, you will be able to: 


e Describe the main features of a superconductor 

¢ Describe the BCS theory of superconductivity 

¢ Determine the critical magnetic field for T = 0 K from magnetic field 
data 

¢ Calculate the maximum emf or current for a wire to remain 
superconducting 


Electrical resistance can be considered as a measure of the frictional force 
in electrical current flow. Thus, electrical resistance is a primary source of 
energy dissipation in electrical systems such as electromagnets, electric 
motors, and transmission lines. Copper wire is commonly used in electrical 
wiring because it has one of the lowest room-temperature electrical 
resistivities among common conductors. (Actually, silver has a lower 
resistivity than copper, but the high cost and limited availability of silver 
outweigh its savings in energy over copper.) 


Although our discussion of conductivity seems to imply that all materials 
must have electrical resistance, we know that this is not the case. When the 
temperature decreases below a critical value for many materials, their 
electrical resistivity drops to zero, and the materials become 
superconductors (see Superconductors). 


Note: 
Watch this NOVA video excerpt, Making Stuff Colder, as an introduction to 
the topic of superconductivity and its many applications. 


Properties of Superconductors 


In addition to zero electrical resistance, superconductors also have perfect 
diamagnetism. In other words, in the presence of an applied magnetic field, 
the net magnetic field within a superconductor is always zero ((link]). 


Therefore, any magnetic field lines that pass through a superconducting 
sample when it is in its normal state are expelled once the sample becomes 
superconducting. These are manifestations of the Meissner effect, which 
you learned about in the chapter on current and resistance. 


Normal Superconducting 


(a) (b) 


(a) In the Meissner effect, a magnetic field is expelled from a material 
once it becomes superconducting. (b) A magnet can levitate above a 
superconducting material, supported by the force expelling the 
magnetic field. (credit b: modification of work by Kevin Jarrett) 


Interestingly, the Meissner effect is not a consequence of the resistance 
being zero. To see why, suppose that a sample placed in a magnetic field 
undergoes a transition in which its resistance drops to zero. From Ohm’s 
law, the current density, j, in the sample is related to the net internal electric 
field, E, and the resistivity p by 7 = E’/p. If p is zero, E must also be zero 
so that j can remain finite. Now E and the magnetic flux @,, through the 
sample are related by Faraday’s law as 

Equation: 


f Bal = ara 
dt 


If E is zero, d®,, /dt is also zero, that is, the magnetic flux through the 
sample cannot change. The magnetic field lines within the sample should 
therefore not be expelled when the transition occurs. Hence, it does not 
follow that a material whose resistance goes to zero has to exhibit the 
Meissner effect. Rather, the Meissner effect is a special property of 
superconductors. 


Another important property of a superconducting material is its critical 
temperature, J’, the temperature below which the material is 
superconducting. The known range of critical temperatures is from a 
fraction of 1 K to slightly above 100 K. Superconductors with critical 
temperatures near this higher limit are commonly known as “high- 
temperature” superconductors. From a practical standpoint, 
superconductors for which T,, >> 77 K are very important. At present, 
applications involving superconductors often still require that 
superconducting materials be immersed in liquid helium (4.2 K) in order to 
keep them below their critical temperature. The liquid helium baths must be 
continually replenished because of evaporation, and cooling costs can easily 
outweigh the savings in using a superconductor. However, 77 K is the 
temperature of liquid nitrogen, which is far more abundant and inexpensive 
than liquid helium. It would be much more cost-effective if we could easily 
fabricate and use high-temperature superconductor components that only 
need to be kept in liquid nitrogen baths to maintain their superconductivity. 


High-temperature superconducting materials are presently in use in various 
applications. An example is the production of magnetic fields in some 
particle accelerators. The ultimate goal is to discover materials that are 
superconducting at room temperature. Without any cooling requirements, 
the bulk of electronic components and transmission lines could be 
superconducting, resulting in dramatic and unprecedented increases in 
efficiency and performance. 


Another important property of a superconducting material is its critical 
magnetic field B.(T), which is the maximum applied magnetic field at a 


temperature T that will allow a material to remain superconducting. An 
applied field that is greater than the critical field will destroy the 
superconductivity. The critical field is zero at the critical temperature and 
increases as the temperature decreases. Plots of the critical field versus 
temperature for several superconducting materials are shown in [link]. The 
temperature dependence of the critical field can be described approximately 


by 


Note: 
Equation: 


where B,(0) is the critical field at absolute zero temperature. [link] lists the 
critical temperatures and fields for two classes of superconductors: type I 
superconductor and type II superconductor. In general, type I 
superconductors are elements, such as aluminum and mercury. They are 
perfectly diamagnetic below a critical field B-(T), and enter the normal 
non-superconducting state once that field is exceeded. The critical fields of 
type I superconductors are generally quite low (well below one tesla). For 
this reason, they cannot be used in applications requiring the production of 
high magnetic fields, which would destroy their superconducting state. 


B,, (tesla) 


0.08 


0.06 


0.04 


0.02 


T (Kelvin) 


The temperature dependence of the critical 
field for several superconductors. 
Superconductivity occurs for magnetic 
fields and temperatures below the curves 
shown. 


Critical Critical Magnetic 
Material Temperature (K) Field (T) 


Material 
Type I 
Al 

Ga 

Hg (a) 
In 

Nb 

Pb 

Sn 

Th 

Zn 
Type II 
Nb3Al 
Nb3Ge 


Nb3Sn 


NbTi 


YBayCu307 


Critical Temperature and Critical Magnetic Field at 7’ = 0 K for Various 


Superconductors 


Critical 
Temperature (K) 


1.2 
1.1 
4.2 
3.4 
9.3 
7.2 
3.7 
1.4 


0.87 


Critical Magnetic 
Field (T) 


0.080 
0.031 
0.00016 


0.0053 


32 
38 
25 
15 


>100 


Type II superconductors are generally compounds or alloys involving 
transition metals or actinide series elements. Almost all superconductors 
with relatively high critical temperatures are type II. They have two critical 
fields, represented by B.i(T’) and B.2(T). When the field is below 
B.(T), type II superconductors are perfectly diamagnetic, and no 
magnetic flux penetration into the material can occur. For a field exceeding 
B.2(T), they are driven into their normal state. When the field is greater 
than B.1(T) but less than B.2(T), type II superconductors are said to be in 
a mixed state. Although there is some magnetic flux penetration in the 
mixed state, the resistance of the material is zero. Within the 
superconductor, filament-like regions exist that have normal electrical and 
magnetic properties interspersed between regions that are superconducting 
with perfect diamagnetism. A representation of this state is given in [link]. 
The magnetic field is expelled from the superconducting regions but exists 
in the normal regions. In general, B.2(T’) is very large compared with the 
critical fields of type I superconductors, so wire made of type II 
superconducting material is suitable for the windings of high-field magnets. 


Superconducting 


A schematic representation of the mixed state 
of a type IT superconductor. Superconductors 
(the gray squares) expel magnetic fields in their 
vicinity. 


Example: 
Niobium Wire 


In an experiment, a niobium (Nb) wire of radius 0.25 mm is immersed in 
liquid helium (7’ = 4.2 K) and required to carry a current of 300 A. Does 
the wire remain superconducting? 

Strategy 

The applied magnetic field can be determined from the radius of the wire 
and current. The critical magnetic field can be determined from [link], the 
properties of the superconductor, and the temperature. If the applied 
magnetic field is greater than the critical field, then superconductivity in 
the Nb wire is destroyed. 

Solution 

At T = 4.2 K, the critical field for Nb is, from [link] and [link], 


Equation: 
,_ (42K : 
9.3K 


In an earlier chapter, we learned the magnetic field inside a current- 
carrying wire of radius a is given by 
Equation: 


B.(4.2 K) = B,(0) = (0.20 T)(0.80) = 0.16 T. 


where r is the distance from the central axis of the wire. Thus, the field at 


the surface of the wire is — . For the niobium wire, this field is 
Equation: 


4 10-’ T m/A)(300 A 
pa Mm x WT m/A)(B00A) _ yon 
2n(2.5 x 10-4 m) 


Since this exceeds the critical 0.16 T, the wire does not remain 
superconducting. 

Significance 

Superconductivity requires low temperatures and low magnetic fields. 
These simultaneous conditions are met less easily for Nb than for many 


other metals. For example, aluminum superconducts at temperatures 7 
times lower and magnetic fields 18 times lower. 


Note: 
Exercise: 


Problem: 


Check Your Understanding What conditions are necessary for 
superconductivity? 


Solution: 


a low temperature and low magnetic field 


Theory of Superconductors 


A successful theory of superconductivity was developed in the 1950s by 
John Bardeen, Leon Cooper, and J. Robert Schrieffer, for which they 
received the Nobel Prize in 1972. This theory is known as the BCS theory. 
BCS theory is complex, so we summarize it qualitatively below. 


In a normal conductor, the electrical properties of the material are due to the 
most energetic electrons near the Fermi energy. In 1956, Cooper showed 
that if there is any attractive interaction between two electrons at the Fermi 
level, then the electrons can form a bound state in which their total energy 
is less than 2p. Two such electrons are known as a Cooper pair. 


It is hard to imagine two electrons attracting each other, since they have like 
charge and should repel. However, the proposed interaction occurs only in 
the context of an atomic lattice. A depiction of the attraction is shown in 
[link]. Electron 1 slightly displaces the positively charged atomic nuclei 
toward itself as it travels past because of the Coulomb attraction. Electron 2 
“sees” a region with a higher density of positive charge relative to the 


surroundings and is therefore attracted into this region and, therefore 
indirectly, to electron 1. Because of the exclusion principle, the two 
electrons of a Cooper pair must have opposite spin. 


A Cooper pair can form as a result of the displacement 
of positive atomic nuclei. Electron 1 slightly displaces 
the positively charged atomic nuclei toward itself as it 
travels past because of the Coulomb attraction. Electron 
2 “sees” a region with a higher density of positive 
charge relative to the surroundings and is therefore 
attracted into this region. 


The BCS theory extends Cooper’s ideas, which are for a single pair of 
electrons, to the entire free electron gas. When the transition to the 
superconducting state occurs, all the electrons pair up to form Cooper pairs. 
On an atomic scale, the distance between the two electrons making up a 
Cooper pair is quite large. Between these electrons are typically about 10° 
other electrons, each also pairs with a distant electron. Hence, there is 
considerable overlap between the wave functions of the individual Cooper 
pairs, resulting in a strong correlation among the motions of the pairs. They 
all move together “in step,” like the members of a marching band. In the 
superconducting transition, the density of states becomes drastically 
changed near the Fermi level. As shown in [link], an energy gap appears 
around Fr because the collection of Cooper pairs has lower ground state 
energy than the Fermi gas of noninteracting electrons. The appearance of 
this gap characterizes the superconducting state. If this state is destroyed, 
then the gap disappears, and the density of states reverts to that of the free 
electron gas. 


g(E) 


A relatively large energy gap is formed around the 
Fermi energy when a material becomes 
superconducting. If this state is destroyed, then the 


gap disappears, and the density of states reverts to 
that of the free electron gas. 


The BCS theory is able to predict many of the properties observed in 
superconductors. Examples include the Meissner effect, the critical 
temperature, the critical field, and, perhaps most importantly, the resistivity 
becoming zero at a critical temperature. We can think about this last 
phenomenon qualitatively as follows. In a normal conductor, resistivity 
results from the interaction of the conduction electrons with the lattice. In 
this interaction, the energy exchanged is on the order of kp7’, the thermal 
energy. In a superconductor, electric current is carried by the Cooper pairs. 
The only way for a lattice to scatter a Cooper pair is to break it up. The 
destruction of one pair then destroys the collective motion of all the pairs. 
This destruction requires energy on the order of 10 eV, which is the size 
of the energy gap. Below the critical temperature, there is not enough 
thermal energy available for this process, so the Cooper pairs travel 
unimpeded throughout the superconductor. 


Finally, it is interesting to note that no evidence of superconductivity has 
been found in the best normal conductors, such as copper and silver. This is 
not unexpected, given the BCS theory. The basis for the formation of the 
superconducting state is an interaction between the electrons and the lattice. 
In the best conductors, the electron-lattice interaction is weakest, as evident 
from their minimal resistivity. We might expect then that in these materials, 
the interaction is so weak that Cooper pairs cannot be formed, and 
superconductivity is therefore precluded. 


Summary 


e A superconductor is characterized by two features: the conduction of 
electrons with zero electrical resistance and the repelling of magnetic 
field lines. 

e A minimum temperature is required for superconductivity to occur. 

e A strong magnetic field destroys superconductivity. 

e Superconductivity can be explain in terms of Cooper pairs. 
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Conceptual Questions 


Exercise: 


Problem: Describe two main features of a superconductor. 


Exercise: 


Problem: How does BCS theory explain superconductivity? 


Solution: 


BSC theory explains superconductivity in terms of the interactions 
between electron pairs (Cooper pairs). One electron in a pair interacts 
with the lattice, which interacts with the second electron. The combine 
electron-lattice-electron interaction binds the electron pair together in a 
way that overcomes their mutual repulsion. 


Exercise: 


Problem: What is the Meissner effect? 
Exercise: 


Problem: 


What impact does an increasing magnetic field have on the critical 
temperature of a semiconductor? 


Solution: 
As the magnitude of the magnetic field is increased, the critical 
temperature decreases. 

Problems 


Exercise: 


Problem: 


At what temperature, in terms of TJ‘, is the critical field of a 
superconductor one-half its value at 7’ = 0 K ? 


Solution: 


LP =010T F 


Exercise: 


Problem: What is the critical magnetic field for lead at T = 2.8 K ? 


Exercise: 


Problem: 


A Pb wire wound in a tight solenoid of diameter of 4.0 mm is cooled 
to a temperature of 5.0 K. The wire is connected in series with a 50-2 
resistor and a variable source of emf. As the emf is increased, what 
value does it have when the superconductivity of the wire is 
destroyed? 


Solution: 


61 kV 
Exercise: 
Problem: 
A tightly wound solenoid at 4.0 K is 50 cm long and is constructed 


from Nb wire of radius 1.5 mm. What maximum current can the 
solenoid carry if the wire is to remain superconducting? 


Additional Problems 


Exercise: 


Problem: 


Potassium fluoride (KF) is a molecule formed by an ionic bond. At 
equilibrium separation the atoms are 79 = 0.255 nm apart. Determine 
the electrostatic potential energy of the atoms. The electron affinity of 
F is 3.40 eV and the ionization energy of K is 4.34 eV. Determine 
dissociation energy. (Neglect the energy of repulsion.) 


Solution: 
Ueco = —5.65eV 
Etom = —4.7leV, Eaiss = 4.71 eV 


Exercise: 


Problem: 


For the preceding problem, sketch the potential energy versus 
separation graph for the bonding of K* and FI ions. (a) Label the 
graph with the energy required to transfer an electron from K to FI. (b) 
Label the graph with the dissociation energy. 


Exercise: 
Problem: 


The separation between hydrogen atoms in a Hz molecule is about 
0.075 nm. Determine the characteristic energy of rotation in eV. 


Solution: 


Eor = 7.43 x 10% eV 
Exercise: 
Problem: 


The characteristic energy of the Cl, molecule is 2.95 x 10~° eV. 
Determine the separation distance between the nitrogen atoms. 


Exercise: 
Problem: Determine the lowest three rotational energy levels of Hy». 


Solution: 


Er = 7.43 x 107% eV; 1 = 0; LE, = 0 eV (no rotation); 

l=1;E, = 1.49 x 10-7eV:1 =2; FB, = 4.46 x 10-7 eV 
Exercise: 

Problem: 


A carbon atom can hybridize in the sp? configuration. (a) What is the 
angle between the hybrid orbitals? 


Exercise: 


Problem: 


List five main characteristics of ionic crystals that result from their 
high dissociation energy. 


Solution: 


ee 


. They are fairly hard and stable. 

ii. They vaporize at relatively high temperatures (1000 to 2000 K). 

iii. They are transparent to visible radiation, because photons in the 
visible portion of the spectrum are not energetic enough to excite 
an electron from its ground state to an excited state. 

iv. They are poor electrical conductors because they contain 
effectively no free electrons. 

v. They are usually soluble in water, because the water molecule has 

a large dipole moment whose electric field is strong enough to 

break the electrostatic bonds between the ions. 


f—e mo 


Exercise: 
Problem: 
Why is bonding in H2* favorable? Express your answer in terms of 
the symmetry of the electron wave function. 
Exercise: 
Problem: 


Astronomers claim to find evidence of He» from light spectra of a 
distant star. Do you believe them? 


Solution: 


No, He atoms do not contain valence electrons that can be shared in 
the formation of a chemical bond. 


Exercise: 


Problem: 


Show that the moment of inertia of a diatomic molecule is I = pr?, 


where p is the reduced mass, and 79 is the distance between the 
masses. 


Exercise: 


Problem: 
Show that the average energy of an electron in a one-dimensional 


metal is related to the Fermi energy by E= + Ep. 


Solution: 


ne 2_ 1/N\3 = 1 
_ mw =a(y) 90k = sEr 


Exercise: 
Problem: 
Measurements of a superconductor’s critical magnetic field (in T) at 


various temperatures (in K) are given below. Use a line of best fit to 
determine B.(0). Assume T, = 9.3 K. 


T (in K) B.(T) 
3.0 0.18 
4.0 0.16 


5.0 0.14 


T (in K) B.(T) 


6.0 0.12 
7.0 0.09 
8.0 0.05 
9.0 0.01 
Exercise: 
Problem: 


Estimate the fraction of Si atoms that must be replaced by As atoms in 
order to form an impurity band. 


Solution: 


An impurity band will be formed when the density of the donor atoms 
is high enough that the orbits of the extra electrons overlap. We saw 
earlier that the orbital radius is about 50 Angstroms, so the maximum 
distance between the impurities for a band to form is 100 Angstroms. 
Thus if we use 1 Angstrom as the interatomic distance between the Si 
atoms, we find that 1 out of 100 atoms along a linear chain must be a 
donor atom. And in a three-dimensional crystal, roughly 1 out of 10° 
atoms must be replaced by a donor atom in order for an impurity band 
to form. 


Exercise: 


Problem: 


Transition in the rotation spectrum are observed at ordinary room 
temperature (7’ = 300 K). According to your lab partner, a peak in the 
spectrum corresponds to a transition from the / = 4 to the / = 1 state. 
Is this possible? If so, determine the momentum of inertia of the 
molecule. 


Exercise: 


Problem: 

Determine the Fermi energies for (a) Mg, (b) Na, and (c) Zn. 
Solution: 

a. br =7.11eV;b. Bp = 3.24eV; c. Ep = 9.46 eV 


Exercise: 


Problem: Find the average energy of an electron in a Zn wire. 
Exercise: 


Problem: 


What value of the repulsion constant, n, gives the measured 
dissociation energy of 158 kcal/mol for CsCl? 


Solution: 


9.1529 
Exercise: 


Problem: 
A physical model of a diamond suggests a BCC packing structure. 
Why is this not possible? 

Challenge Problems 


Exercise: 


Problem: 


For an electron in a three-dimensional metal, show that the average 
Er 


energy is given by E= — i Eg(E)dE = 2 Ep, 
0 


Where N is the total number electrons in the metal. 


Solution: 


In three dimensions, the energy of an electron is given by: 

E = R’ Ej, where R? = n? + n3 + n3. Each allowed energy state 
corresponds to node in N space (nj, 2,3). The number of particles 
corresponds to the number of states (nodes) in the first octant, within a 
sphere of radius, R. This number is given by: N = 2 (+) ($)7R?, 
where the factor 2 accounts for two states of spin. The density of states 
is found by differentiating this expression by energy: 


3/2 = 
g(F) = a (53) E/?, Integrating gives: E = a Ep. 


Glossary 


BCS theory 


theory of superconductivity based on electron-lattice-electron 
interactions 


Cooper pair 


coupled electron pair in a superconductor 


critical magnetic field 
maximum field required to produce superconductivity 


critical temperature 


maximum temperature to produce superconductivity 


type I superconductor 
superconducting element, such as aluminum or mercury 


type II superconductor 
superconducting compound or alloy, such as a transition metal or an 
actinide series element 


Introduction 
class="introduction" 


The Sun is 
powered by 
nuclear fusion 
in its core. The 
core converts 
approximately 
1938 
protons/secon 
d into helium 
ata 
temperature of 
14 million K. 
This process 
releases 
energy in the 
form of 
photons, 
neutrinos, and 
other particles. 
(credit: 
modification 
of work by 
EIT SOHO 
Consortium, 
ESA, NASA) 


In this chapter, we study the composition and properties of the atomic 
nucleus. The nucleus lies at the center of an atom, and consists of protons 
and neutrons. A deep understanding of the nucleus leads to numerous 
valuable technologies, including devices to date ancient rocks, map the 
galactic arms of the Milky Way, and generate electrical power. 


The Sun is the main source of energy in the solar system. The Sun is 109 
Earth diameters across, and accounts for more than 99% of the total mass of 
the solar system. The Sun shines by fusing hydrogen nuclei—protons— 
deep inside its interior. Once this fuel is spent, the Sun will burn helium 
and, later, other nuclei. Nuclear fusion in the Sun is discussed toward the 
end of this chapter. In the meantime, we will investigate nuclear properties 
that govern all nuclear processes, including fusion. 


Properties of Nuclei 
By the end of this section, you will be able to: 


¢ Describe the composition and size of an atomic nucleus 

¢ Use a nuclear symbol to express the composition of an atomic nucleus 

e Explain why the number of neutrons is greater than protons in heavy nuclei 
¢ Calculate the atomic mass of an element given its isotopes 


The atomic nucleus is composed of protons and neutrons ([link]). Protons and 
neutrons have approximately the same mass, but protons carry one unit of positive 
charge (+e), and neutrons carry no charge. These particles are packed together into an 
extremely small space at the center of an atom. According to scattering experiments, 
the nucleus is spherical or ellipsoidal in shape, and about 1/100,000th the size of a 
hydrogen atom. If an atom were the size of a major league baseball stadium, the 
nucleus would be roughly the size of the baseball. Protons and neutrons within the 
nucleus are called nucleons. 

Neutron 


The atomic nucleus is composed of protons 
and neutrons. Protons are shown in blue, 
and neutrons are shown in red. 


Counts of Nucleons 


The number of protons in the nucleus is given by the atomic number, Z. The number 
of neutrons in the nucleus is the neutron number, N. The total number of nucleons is 
the mass number, A. These numbers are related by 


Note: 
Equation: 


INS 74 oe INE 


A nucleus is represented symbolically by 


Note: 
Equation: 


where X represents the chemical element, A is the mass number, and Z is the atomic 
number. For example, !2C represents the carbon nucleus with six protons and six 


neutrons (or 12 nucleons). 


A graph of the number N of neutrons versus the number Z of protons for a range of 
stable nuclei (nuclides) is shown in [link]. For a given value of Z, multiple values of N 
(blue points) are possible. For small values of Z, the number of neutrons equals the 
number of protons (NV = P), and the data fall on the red line. For large values of Z, 
the number of neutrons is greater than the number of protons (IV > P), and the data 
points fall above the red line. The number of neutrons is generally greater than the 
number of protons for Z > 15. 
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This graph plots the number of neutrons N against the 
number of protons Z for stable atomic nuclei. Larger 
nuclei, have more neutrons than protons. 


A chart based on this graph that provides more detailed information about each 
nucleus is given in [link]. This chart is called a chart of the nuclides. Each cell or tile 
represents a separate nucleus. The nuclei are arranged in order of ascending Z (along 
the horizontal direction) and ascending N (along the vertical direction). 
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Partial chart of the nuclides. For stable nuclei (dark blue backgrounds), cell 
values represent the percentage of nuclei found on Earth with the same atomic 
number (percent abundance). For the unstable nuclei, the number represents the 
half-life. 


Atoms that contain nuclei with the same number of protons (Z) and different numbers 
of neutrons (N) are called isotopes. For example, hydrogen has three isotopes: normal 
hydrogen (1 proton, no neutrons), deuterium (one proton and one neutron), and tritium 
(one proton and two neutrons). Isotopes of a given atom share the same chemical 
properties, since these properties are determined by interactions between the outer 
electrons of the atom, and not the nucleons. For example, water that contains 
deuterium rather than hydrogen (“heavy water’) looks and tastes like normal water. 
The following table shows a list of common isotopes. 


Element Symbol 
H 


2 
Hydrogen One 


Carbon 


Nitrogen 


Oxygen 180 


199 


Mass 
Number 


1 


2 


ike 


Mass 
(Atomic 
Mass 
Units) 
1.0078 


2.0141 
3.0160 


12.0000 


13.0034 
14.0032 


14.0031 


15.0001 
16.0061 


15.9949 
16.9991 


17.9992 


19.0035 


Percent 
Abundance* 


99.99 


98.91 


Common Isotopes*No entry if less than 0.001 (trace amount). 
**Stable if half-life > 10 seconds. 


Half- 
life** 


stable 
stable 


12.32 
hd 


stable 
stable 


9730 
y 


stable 
stable 


TA0 
S 


stable 
stable 
stable 


26.46 
S 


Why do neutrons outnumber protons in heavier nuclei ({link])? The answer to this 
question requires an understanding of forces inside the nucleus. Two types of forces 
exist: (1) the long-range electrostatic (Coulomb) force that makes the positively 
charged protons repel one another; and (2) the short-range strong nuclear force that 


makes all nucleons in the nucleus attract one another. You may also have heard of a 
“weak” nuclear force. This force is responsible for some nuclear decays, but as the 
name implies, it does not play a role in stabilizing the nucleus against the strong 
Coulomb repulsion it experiences. We discuss strong nuclear force in more detail in 
the next chapter when we cover particle physics. Nuclear stability occurs when the 
attractive forces between nucleons compensate for the repulsive, long-range 
electrostatic forces between all protons in the nucleus. For heavy nuclei (Z > 15), 
excess neutrons are necessary to keep the electrostatic interactions from breaking the 
nucleus apart, as shown in [link]. 


Repulsive force between Attractive forces between 
distant protons adjacent nucleons 


(a) (b) 


(a) The electrostatic force is repulsive and has long 
range. The arrows represent outward forces on protons 
(in blue) at the nuclear surface by a proton (also in 
blue) at the center. (b) The strong nuclear force acts 
between neighboring nucleons. The arrows represent 
attractive forces exerted by a neutron (in red) on its 
nearest neighbors. 


Because of the existence of stable isotopes, we must take special care when quoting 
the mass of an element. For example, Copper (Cu) has two stable isotopes: 
Equation: 


63Cu (62.929595 g/mol) with an abundance of 69.09% 


Equation: 


$oCu (64.927786 g/mol) with an abundance of 30.91% 


Given these two “versions” of Cu, what is the mass of this element? The atomic mass 
of an element is defined as the weighted average of the masses of its isotopes. Thus, 
the atomic mass of Cu is 

Mcu = (62.929595) (0.6909) + (64.927786) (0.3091) = 63.55 g/mol. The mass 
of an individual nucleus is often expressed in atomic mass units (u), where 

u = 1.66054 x 10-?’kg. (An atomic mass unit is defined as 1/12th the mass of a 
12C nucleus.) In atomic mass units, the mass of a helium nucleus (A = 4) is 
approximately 4 u. A helium nucleus is also called an alpha (a) particle. 


Nuclear Size 


The simplest model of the nucleus is a densely packed sphere of nucleons. The volume 
V of the nucleus is therefore proportional to the number of nucleons A, expressed by 
Equation: 


where r is the radius of a nucleus and k is a constant with units of volume. Solving for 
r, we have 


Note: 
Equation: 


r=rAls 


where ro is a constant. For hydrogen (A = 1), ro corresponds to the radius of a single 
proton. Scattering experiments support this general relationship for a wide range of 
nuclei, and they imply that neutrons have approximately the same radius as protons. 
The experimentally measured value for rg is approximately 1.2 femtometer (recall that 
1fm = 10-'m). 


Example: 

The Iron Nucleus 

Find the radius (r) and approximate density () of a Fe-56 nucleus. Assume the mass 
of the Fe-56 nucleus is approximately 56 u. 

Strategy 

(a) Finding the radius of 5°Fe is a straightforward application of r = r9.A‘/°, given 
A = 56. (b) To find the approximate density of this nucleus, assume the nucleus is 
spherical. Calculate its volume using the radius found in part (a), and then find its 
density from p = m/V. 

Solution 


a. The radius of a nucleus is given by 
Equation: 


i roAl A 
Substituting the values for r9 and A yields 
Equation: 


r = (1.2fm)(56)"/? = (1.2 fm) (3.83) 


= AO a1d: 
b. Density is defined to be po = m/V, which for a sphere of radius r is 
Equation: 
m_ m 
fer (4/3)nr3 
Substituting known values gives 
Equation: 
56 
= SSS = 0.138 u/fm*. 
(1.33) (3.14) (4.6 fm) 
Converting to units of kg/m®, we find 
Equation: 
1fm 


p = (0.138 u/fm*)(1.66 x 1072” ke/u) ( ) = 2.3 x 10!’ kg/m’. 


10°! m 


Significance 


a. The radius of the Fe-56 nucleus is found to be approximately 5 fm, so its 
diameter is about 10 fm, or 10-‘*m. In previous discussions of Rutherford’s 
scattering experiments, a light nucleus was estimated to be 10-!°m in diameter. 
Therefore, the result shown for a mid-sized nucleus is reasonable. 

b. The density found here may seem incredible. However, it is consistent with 
earlier comments about the nucleus containing nearly all of the mass of the atom 
in a tiny region of space. One cubic meter of nuclear matter has the same mass as 
a cube of water 61 km on each side. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Nucleus X is two times larger than nucleus Y. 
What is the ratio of their atomic masses? 


Solution: 


eight 


Summary 


e The atomic nucleus is composed of protons and neutrons. 

e The number of protons in the nucleus is given by the atomic number, Z. The 
number of neutrons in the nucleus is the neutron number, N. The number of 
nucleons is mass number, A. 

e Atomic nuclei with the same atomic number, Z, but different neutron numbers, N, 
are isotopes of the same element. 

e The atomic mass of an element is the weighted average of the masses of its 
isotopes. 


Conceptual Questions 


Exercise: 


Problem: 


Define and make clear distinctions between the terms neutron, nucleon, nucleus, 
and nuclide. 


Solution: 


The nucleus of an atom is made of one or more nucleons. A nucleon refers to 
either a proton or neutron. A nuclide is a stable nucleus. 


Exercise: 


Problem: 
What are isotopes? Why do isotopes of the same atom share the same chemical 
properties? 

Problems 


Exercise: 


Problem: 


Find the atomic numbers, mass numbers, and neutron numbers for (a) os Cu, (b) 
2tNa, (c) 734Po, (d) 53Ca, and (e) 788Pb. 


Solution: 


Use the rule A = 7+ N. 


Atomic Number Neutron Number Mass Number 
(Z) (N) (A) 

(a) 29 29 58 

(b) 11 13 24 


(c) 84 126 210 


Atomic Number 


Neutron Number 


Mass Number 


(Z) (N) (A) 
(d) 20 25 45 
(e) 82 124 206 
Exercise: 
Problem: 


Silver has two stable isotopes. The nucleus, 10 Ag, has atomic mass 106.905095 
g/mol with an abundance of 51.83%; whereas '{? Ag has atomic mass 
108.904754 g/mol with an abundance of 48.17%. Find the atomic mass of the 
element silver. 

Exercise: 


Problem: 


The mass (M) and the radius (r) of a nucleus can be expressed in terms of the 
mass number, A. (a) Show that the density of a nucleus is independent of A. (b) 
Calculate the density of a gold (Au) nucleus. Compare your answer to that for 
iron (Fe). 


Solution: 

ar=rAl? p= rae : 

bp = 2.3 x 10 kg/m° 
Exercise: 

Problem: 


A particle has a mass equal to 10 u. If this mass is converted completely into 
energy, how much energy is released? Express your answer in mega-electron 
volts (MeV). (Recall that 1eV = 1.6 x 10°19J.) 


Exercise: 


Problem: 


Find the length of a side of a cube having a mass of 1.0 kg and the density of 
nuclear matter. 


Solution: 


side length = 1.6 pm 
Exercise: 
Problem: 
The detail that you can observe using a probe is limited by its wavelength. 


Calculate the energy of a particle that has a wavelength of 1 x 10~‘®m, small 
enough to detect details about one-tenth the size of a nucleon. 


Glossary 


atomic mass 
total mass of the protons, neutrons, and electrons in a single atom 


atomic mass unit 
unit used to express the mass of an individual nucleus, where 
lu = 1.66054 x 10°?" kg 


atomic nucleus 
tightly packed group of nucleons at the center of an atom 


atomic number 
number of protons in a nucleus 


chart of the nuclides 
graph comprising stable and unstable nuclei 


isotopes 
nuclei having the same number of protons but different numbers of neutrons 


mass number 
number of nucleons in a nucleus 


neutron number 
number of neutrons in a nucleus 


nucleons 
protons and neutrons found inside the nucleus of an atom 


nuclide 


nucleus 


radius of a nucleus 
radius of a nucleus is defined as r = rp A!/® 


strong nuclear force 
force that binds nucleons together in the nucleus 


Nuclear Binding Energy 
By the end of this section, you will be able to: 


e Calculate the mass defect and binding energy for a wide range of 
nuclei 

e Use a graph of binding energy per nucleon (BEN) versus mass number 
(A) graph to assess the relative stability of a nucleus 

e Compare the binding energy of a nucleon in a nucleus to the ionization 
energy of an electron in an atom 


The forces that bind nucleons together in an atomic nucleus are much 
greater than those that bind an electron to an atom through electrostatic 
attraction. This is evident by the relative sizes of the atomic nucleus and the 
atom (10> and 101° m, respectively). The energy required to pry a 
nucleon from the nucleus is therefore much larger than that required to 
remove (or ionize) an electron in an atom. In general, all nuclear changes 
involve large amounts of energy per particle undergoing the reaction. This 
has numerous practical applications. 


Mass Defect 


According to nuclear particle experiments, the total mass of a nucleus 
(Mnuc) is less than the sum of the masses of its constituent nucleons 
(protons and neutrons). The mass difference, or mass defect, is given by 


Note: 
Equation: 


Am = 2m, +(A—Z)m, — Mane 


where Zm, is the total mass of the protons, (A — Z)mzy is the total mass 
of the neutrons, and m,,,. is the mass of the nucleus. According to 


Einstein’s special theory of relativity, mass is a measure of the total energy 
of asystem (EF = mc?). Thus, the total energy of a nucleus is less than the 
sum of the energies of its constituent nucleons. The formation of a nucleus 
from a system of isolated protons and neutrons is therefore an exothermic 
reaction—meaning that it releases energy. The energy emitted, or radiated, 
in this process is (Am)c?. 


Now imagine this process occurs in reverse. Instead of forming a nucleus, 
energy is put into the system to break apart the nucleus ({link]). The amount 
of energy required is called the total binding energy (BE), Fy. 


Note: 

Binding Energy 

The binding energy is equal to the amount of energy released in forming 
the nucleus, and is therefore given by 

Equation: 


E, = (Am)c? . 


Experimental results indicate that the binding energy for a nucleus with 
mass number A > 8 is roughly proportional to the total number of nucleons 
in the nucleus, A. The binding energy of a magnesium nucleus (74Mg), for 
example, is approximately two times greater than for the carbon nucleus ( 


12 C). 


+ Binding energy 3 ‘*) *) 0 


Nucleus Separated nucleons 
(smaller mass) (greater mass) 


The binding energy is the energy required to break a nucleus into its 
constituent protons and neutrons. A system of separated nucleons has a 
greater mass than a system of bound nucleons. 


Example: 

Mass Defect and Binding Energy of the Deuteron 

Calculate the mass defect and the binding energy of the deuteron. The mass 
of the deuteron is mp = 3.34359 x 10~?"kg or 1875.61 MeV/c’. 
Solution 

From [link], the mass defect for the deuteron is 

Equation: 


Am =m,+™m, —™p 
= 938.28 MeV/c? + 939.57 MeV/c? — 1875.61 MeV/c? 
= 2.24 MeV/c’. 


The binding energy of the deuteron is then 
Equation: 


Ey = (Am)c? = (2.24 MeV/c’) (c?) = 2.24 MeV. 


Over two million electron volts are needed to break apart a deuteron into a 
proton and a neutron. This very large value indicates the great strength of 
the nuclear force. By comparison, the greatest amount of energy required 
to liberate an electron bound to a hydrogen atom by an attractive Coulomb 
force (an electromagnetic force) is about 10 eV. 


Graph of Binding Energy per Nucleon 


In nuclear physics, one of the most important experimental quantities is the 
binding energy per nucleon (BEN), which is defined by 


Note: 
Equation: 


BEN = — 


This quantity is the average energy required to remove an individual 
nucleon from a nucleus—analogous to the ionization energy of an electron 
in an atom. If the BEN is relatively large, the nucleus is relatively stable. 
BEN values are estimated from nuclear scattering experiments. 


A graph of binding energy per nucleon versus atomic number A is given in 
[link]. This graph is considered by many physicists to be one of the most 
important graphs in physics. Two notes are in order. First, typical BEN 
values range from 6-10 MeV, with an average value of about 8 MeV. In 
other words, it takes several million electron volts to pry a nucleon from a 
typical nucleus, as compared to just 13.6 eV to ionize an electron in the 
ground state of hydrogen. This is why nuclear force is referred to as the 
“strong” nuclear force. 


Second, the graph rises at low A, peaks very near iron (Fe, A = 56), and 
then tapers off at high A. The peak value suggests that the iron nucleus is 
the most stable nucleus in nature (it is also why nuclear fusion in the cores 
of stars ends with Fe). The reason the graph rises and tapers off has to do 
with competing forces in the nucleus. At low values of A, attractive nuclear 
forces between nucleons dominate over repulsive electrostatic forces 
between protons. But at high values of A, repulsive electrostatic forces 
between forces begin to dominate, and these forces tend to break apart the 
nucleus rather than hold it together. 
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In this graph of binding energy per nucleon for stable nuclei, the BEN 
is greatest for nuclei with a mass near °°Fe. Therefore, fusion of nuclei 
with mass numbers much less than that of Fe, and fission of nuclei 
with mass numbers greater than that of Fe, are exothermic processes. 


As we will see, the BEN-versus-A graph implies that nuclei divided or 
combined release an enormous amount of energy. This is the basis for a 
wide range of phenomena, from the production of electricity at a nuclear 
power plant to sunlight. 


Example: 

Tightly Bound Alpha Nuclides 

Calculate the binding energy per nucleon of an *He (a particle). 
Strategy 

Determine the total binding energy (BE) using the equation 

BE = (Am)c?, where Am is the mass defect. The binding energy per 
nucleon (BEN) is BE divided by A. 

Solution 

For *He, we have Z = N = 2. The total binding energy is 

Equation: 


BE = {[2m, + 2m,] — m (*He) }c’. 


These masses are m (*He) = 4.002602 u, m, = 1.007825 u, and 
Mn = 1.008665 u. Thus we have, 
Equation: 


BE = (0.030378 u)c?. 


Noting that 1 u = 931.5 MeV/c’, we find 
Equation: 


BE = (0.030378) (931.5 MeV/c?)c” 
= 28.3 MeV. 


Since A = 4, the total binding energy per nucleon is 
Equation: 


BEN = 7.07 MeV/nucleon. 


Significance 

Notice that the binding energy per nucleon for “He is much greater than 
for the hydrogen isotopes (only ~ 3 MeV /nucleon). Therefore, helium 
nuclei cannot break down hydrogen isotopes without energy being put into 
the system. 


Note: 
Exercise: 


Problem: 


Check Your Understanding If the binding energy per nucleon is 
large, does this make it harder or easier to strip off a nucleon from a 
nucleus? 


Solution: 


harder 


Summary 


e The mass defect of a nucleus is the difference between the total mass 
of a nucleus and the sum of the masses of all its constituent nucleons. 

e The binding energy (BE) of a nucleus is equal to the amount of energy 
released in forming the nucleus, or the mass defect multiplied by the 
speed of light squared. 

e A graph of binding energy per nucleon (BEN) versus atomic number A 
implies that nuclei divided or combined release an enormous amount 
of energy. 

e The binding energy of a nucleon in a nucleus is analogous to the 
ionization energy of an electron in an atom. 


Conceptual Questions 


Exercise: 


Problem: 


Explain why a bound system should have less mass than its 
components. Why is this not observed traditionally, say, for a building 
made of bricks? 


Solution: 


A bound system should have less mass than its components because of 
energy-mass equivalence (E = me’). If the energy of a system is 
reduced, the total mass of the system is reduced. If two bricks are 
placed next to one another, the attraction between them is purely 
gravitational, assuming the bricks are electrically neutral. The 
gravitational force between the bricks is relatively small (compared to 
the strong nuclear force), so the mass defect is much too small to be 
observed. If the bricks are glued together with cement, the mass defect 
is likewise small because the electrical interactions between the 
electrons involved in the bonding are still relatively small. 


Exercise: 
Problem: 
Why is the number of neutrons greater than the number of protons in 


stable nuclei that have an A greater than about 40? Why is this effect 
more pronounced for the heaviest nuclei? 


Exercise: 
Problem: 
To obtain the most precise value of the binding energy per nucleon, it 
is important to take into account forces between nucleons at the 


surface of the nucleus. Will surface effects increase or decrease 
estimates of BEN? 


Solution: 


Nucleons at the surface of a nucleus interact with fewer nucleons. This 
reduces the binding energy per nucleon, which is based on an average 
over all the nucleons in the nucleus. 


Problems 


Exercise: 
Problem: 


How much energy would be released if six hydrogen atoms and six 
neutrons were combined to form 12C? 


Solution: 


92.4 MeV 
Exercise: 


Problem: 


Find the mass defect and the binding energy for the helium-4 nucleus. 
Exercise: 

Problem: 

°©F'e is among the most tightly bound of all nuclides. It makes up more 

than 90% of natural iron. Note that °Fe has even numbers of protons 

and neutrons. Calculate the binding energy per nucleon for °°Fe and 


compare it with the approximate value obtained from the graph in 
[link]. 


Solution: 


8.790 MeV & graph’s value 


Exercise: 


Problem: 


209Bi is the heaviest stable nuclide, and its BEN is low compared with 
medium-mass nuclides. Calculate BEN for this nucleus and compare it 
with the approximate value obtained from the graph in [link]. 


Exercise: 
Problem: 
(a) Calculate BEN for 7°°U, the rarer of the two most common 


uranium isotopes; (b) Calculate BEN for 7°°U. (Most of uranium is 
217) 


Solution: 


a. 7.570 MeV; b. 7.591 MeV * graph’s value 
Exercise: 


Problem: 


The fact that BEN peaks at roughly A = 60 implies that the range of 
the strong nuclear force is about the diameter of this nucleus. 


(a) Calculate the diameter of A = 60 nucleus. 


(b) Compare BEN for °°Ni and °°Sr. The first is one of the most 
tightly bound nuclides, whereas the second is larger and less tightly 
bound. 


Glossary 
binding energy (BE) 
energy needed to break a nucleus into its constituent protons and 


neutrons 


binding energy per nucleon (BEN) 
energy need to remove a nucleon from a nucleus 


mass defect 
difference between the mass of a nucleus and the total mass of its 
constituent nucleons 


Radioactive Decay 
By the end of this section, you will be able to: 


e Describe the decay of a radioactive substance in terms of its decay 
constant and half-life 

e Use the radioactive decay law to estimate the age of a substance 

e Explain the natural processes that allow the dating of living tissue 
using !4C 


In 1896, Antoine Becquerel discovered that a uranium-rich rock emits 
invisible rays that can darken a photographic plate in an enclosed container. 
Scientists offer three arguments for the nuclear origin of these rays. First, 
the effects of the radiation do not vary with chemical state; that is, whether 
the emitting material is in the form of an element or compound. Second, the 
radiation does not vary with changes in temperature or pressure—both 
factors that in sufficient degree can affect electrons in an atom. Third, the 
very large energy of the invisible rays (up to hundreds of eV) is not 
consistent with atomic electron transitions (only a few eV). Today, this 
radiation is explained by the conversion of mass into energy deep within the 
nucleus of an atom. The spontaneous emission of radiation from nuclei is 
called nuclear radioactivity ({link]). 


The international ionizing 
radiation symbol is universally 


recognized as the warning 
symbol for nuclear radiation. 


Radioactive Decay Law 


When an individual nucleus transforms into another with the emission of 
radiation, the nucleus is said to decay. Radioactive decay occurs for all 
nuclei with Z > 82, and also for some unstable isotopes with Z < 83. The 
decay rate is proportional to the number of original (undecayed) nuclei N in 
a substance. The number of nuclei lost to decay, —dN in time interval dt, is 
written 


Note: 
Equation: 


where A is called the decay constant. (The minus sign indicates the number 
of original nuclei decreases over time.) In other words, the more nuclei 
available to decay, the more that do decay (in time dt). This equation can be 
rewritten as 

Equation: 


dN 
— = —Adt. 
N 


Integrating both sides of the equation, and defining No to be the number of 
nuclei at t = 0, we obtain 
Equation: 


This gives us 
Equation: 


N 
In — = —At. 
No 


Taking the left and right sides of the equation as a power of e, we have the 
radioactive decay law. 


Note: 

Radioactive Decay Law 

The total number N of radioactive nuclei remaining after time t is 
Equation: 


N =Noe™ 


where A is the decay constant for the particular nucleus. 


The total number of nuclei drops very rapidly at first, and then more slowly 
([link]). 


A plot of the radioactive decay law demonstrates that the 
number of nuclei remaining in a decay sample drops 
dramatically during the first moments of decay. 


The half-life (Ty /2) of a radioactive substance is defined as the time for 
half of the original nuclei to decay (or the time at which half of the original 
nuclei remain). The half-lives of unstable isotopes are shown in the chart of 
nuclides in [link]. The number of radioactive nuclei remaining after an 
integer (n) number of half-lives is therefore 

Equation: 


eae 
Qn 


If the decay constant (A) is large, the half-life is small, and vice versa. To 
determine the relationship between these quantities, note that when 
t=) /2, then N = No/2. Thus, [link] can be rewritten as 

Equation: 


“ — Noe 22. 


Dividing both sides by No and taking the natural logarithm yields 
Equation: 


In 2 — Ine 
2 
which reduces to 
Note: 
Equation: 
0.693 
—— : 
T 1/2 


Thus, if we know the half-life T,/> of a radioactive substance, we can find 


its decay constant. The lifetime T of a radioactive substance is defined as 
the average amount of time that a nucleus exists before decaying. The 
lifetime of a substance is just the reciprocal of the decay constant, written as 


Note: 
Equation: 


>|R 


The activity A is defined as the magnitude of the decay rate, or 
Equation: 


A=—-—— =AN=AMe™. 


The infinitesimal change dN in the time interval dt is negative because the 
number of parent (undecayed) particles is decreasing, so the activity (A) is 
positive. Defining the initial activity as Ang = ANo, we have 


Note: 
Equation: 


A= Ane 


Thus, the activity A of a radioactive substance decreases exponentially with 
time ([link]). 


-[ 


(a) (b) 


(a) A plot of the activity as a function of time (b) If we measure the 
activity at different times, we can plot In A versus t, and obtain a 
straight line. 


Example: 

Decay Constant and Activity of Strontium-90 

The half-life of strontium-90, 33Sr, is 28.8 y. Find (a) its decay constant 
and (b) the initial activity of 1.00 g of the material. 

Strategy 

We can find the decay constant directly from [link]. To determine the 
activity, we first need to find the number of nuclei present. 

Solution 


a. The decay constant is found to be 
Equation: 


0.693 0.693 1 
La - ( ) (= =i <i a 
T1/2 Ti/2 3.16 x 10's 


b. The atomic mass of 3°Sr is 89.91 g. Using Avogadro’s number 
Na = 6.022 x 10?° atoms/mol, we find the initial number of nuclei 
in 1.00 g of the material: 
Equation: 
1.00 ¢g 


N, = N,) =6.70 x 107! nuclei. 
: 30.91 ¢ | 4) 


From this, we find that the activity Ag at ¢ = 0 for 1.00 g of 
strontium-90 is 
Equation: 
Ag =ANo 
= (7.61 x 10°" s~!)(6.70 x 10'nuclei) 
= 5.10 x 10” decays/s. 


Expressing A in terms of the half-life of the substance, we get 
Equation: 


A = Age (0-698/T1/2)Ti2 ae Age © = Ao /2. 
Therefore, the activity is halved after one half-life. We can determine the 


decay constant A by measuring the activity as a function of time. Taking the 
natural logarithm of the left and right sides of [link], we get 


Note: 
Equation: 


In A = —At+I1n Ap. 


This equation follows the linear form y = mz + b. If we plot In A versus t, 
we expect a straight line with slope —A and y-intercept In Ag ([link](b)). 
Activity A is expressed in units of becquerels (Bq), where one 

1 Bq = 1 decay per second. This quantity can also be expressed in decays 
per minute or decays per year. One of the most common units for activity is 
the curie (Ci), defined to be the activity of 1 g of 2*°Ra. The relationship 
between the Bq and Ci is 

Equation: 


1 Ci = 3.70 x 10’°Bgq. 


Example: 

What is '*C Activity in Living Tissue? 

Approximately 20% of the human body by mass is carbon. Calculate the 
activity due to *C in 1.00 kg of carbon found in a living organism. 
Express the activity in units of Bq and Ci. 

Strategy 

The activity of *C is determined using the equation Ag = A.No, where A 
is the decay constant and Np is the number of radioactive nuclei. The 
number of !*C nuclei in a 1.00-kg sample is determined in two steps. First, 
we determine the number of !?C nuclei using the concept of a mole. 
Second, we multiply this value by 1.3 x Os (the known abundance of 
14C in a carbon sample from a living organism) to determine the number 
of *C nuclei ina living organism. The decay constant is determined from 
the known half-life of 14C (available from [link]). 

Solution 

One mole of carbon has a mass of 12.0 g, since it is nearly pure !?C. Thus, 
the number of carbon nuclei in a kilogram is 

Equation: 


6.02 x 107° mol! 


12) _ 
eo 12.0 g/mol 


x (1000 g) = 5.02 x 10”. 


The number of “C nuclei in 1 kg of carbon is therefore 


Equation: 


Nitta) (O02ecdUe le se a l0mey 6520 10.2 


Now we can find the activity A by using the equation A = ae “ 
Entering known values gives us 
Equation: 
0.693 (6.52 x 10%) ae 
= —_.—_ = 7.89 x 10 y 


5730 y 


or 7.89 x 10° decays per year. To convert this to the unit Bq, we simply 
convert years to seconds. Thus, 
Equation: 


1.00 y 


A780 <0). ) 
( 4 are x 107s 


= 250 Bq, 


or 250 decays per second. To express A in curies, we use the definition of a 
curie, 


Equation: 
250 B 
= ee = 6.76 x 107° Ci. 
3.7 x 10'° Bq/Ci 
Thus, 
Equation: 
A = 6.76 nCi. 
Significance 


Approximately 20% of the human body by weight is carbon. Hundreds of 
14C decays take place in the human body every second. Carbon-14 and 
other naturally occurring radioactive substances in the body compose a 
person’s background exposure to nuclear radiation. As we will see later in 
this chapter, this activity level is well below the maximum recommended 
dosages. 


Radioactive Dating 


Radioactive dating is a technique that uses naturally occurring 
radioactivity to determine the age of a material, such as a rock or an ancient 
artifact. The basic approach is to estimate the original number of nuclei in a 
material and the present number of nuclei in the material (after decay), and 
then use the known value of the decay constant A and [link] to calculate the 
total time of the decay, t. 


An important method of radioactive dating is carbon-14 dating. Carbon-14 
nuclei are produced when high-energy solar radiation strikes ‘4N nuclei in 
the upper atmosphere and subsequently decay with a half-life of 5730 years. 
Radioactive carbon has the same chemistry as stable carbon, so it combines 
with the ecosphere and eventually becomes part of every living organism. 
Carbon-14 has an abundance of 1.3 parts per trillion of normal carbon. 
Therefore, if you know the number of carbon nuclei in an object, you 
multiply that number by 1.3 x 10~/* to find the number of !4C nuclei in 
that object. When an organism dies, carbon exchange with the environment 
ceases, and !4C is not replenished as it decays. 


By comparing the abundance of !4C in an artifact, such as mummy 
wrappings, with the normal abundance in living tissue, it is possible to 
determine the mummy’s age (or the time since the person’s death). Carbon- 
14 dating can be used for biological tissues as old as 50,000 years, but is 
generally most accurate for younger samples, since the abundance of '4C 
nuclei in them is greater. Very old biological materials contain no '4C at all. 
The validity of carbon dating can be checked by other means, such as by 
historical knowledge or by tree-ring counting. 


Example: 

An Ancient Burial Cave 

In an ancient burial cave, your team of archaeologists discovers ancient 
wood furniture. Only 80% of the original ‘“C remains in the wood. How 
old is the furniture? 

Strategy 


The problem statement implies that NV /No = 0.80. Therefore, the 
equation N = Noe~~“ can be used to find the product, At. We know the 
half-life of !4C is 5730 y, so we also know the decay constant, and 
therefore the total decay time t. 

Solution 

Solving the equation N = Noe~* for N/ No gives us 

Equation: 


N 
Ee 
No 
Thus, 
Equation: 
0.80=e . 


Taking the natural logarithm of both sides of the equation yields 
Equation: 


In 0.80 = —At, 
so that 
Equation: 

—0.223 = —At. 
Rearranging the equation to isolate t gives us 
Equation: 

0.223 
a Fak, 
where 
Equation: 
ee 0.693 0.693 
ti = BT30y" 


Combining this information yields 


Equation: 


2 
feces = 1844 y. 


0.693 
5730 y 
Significance 


The furniture is almost 2000 years old—an impressive discovery. The 
typical uncertainty on carbon-14 dating is about 5%, so the furniture is 
anywhere between 1750 and 1950 years old. This date range must be 
confirmed by other evidence, such as historical records. 


Note: 
Exercise: 


Problem: 


Check Your Understanding A radioactive nuclide has a high decay 
rate. What does this mean for its half-life and activity? 


Solution: 


Half-life is inversely related to decay rate, so the half-life is short. 
Activity depends on both the number of decaying particles and the 
decay rate, so the activity can be great or small. 


Note: 
Visit the Radioactive Dating Game to learn about the types of radiometric 
dating and try your hand at dating some ancient objects. 


Summary 


e In the decay of a radioactive substance, if the decay constant (A) is 
large, the half-life is small, and vice versa. 

¢ The radioactive decay law, N = Noe, uses the properties of 
radioactive substances to estimate the age of a substance. 

e Radioactive carbon has the same chemistry as stable carbon, so it 
mixes into the ecosphere and eventually becomes part of every living 
organism. By comparing the abundance of ‘4C in an artifact with the 
normal abundance in living tissue, it is possible to determine the 
artifact’s age. 


Conceptual Questions 


Exercise: 


Problem: 


How is the initial activity rate of a radioactive substance related to its 
half-life? 


Exercise: 


Problem: 


For the carbon dating described in this chapter, what important 
assumption is made about the time variation in the intensity of cosmic 
rays? 


Solution: 


That it is constant. 


Problems 


Exercise: 


Problem: 


A sample of radioactive material is obtained from a very old rock. A 
plot InA verses t yields a slope value of —10-°s~? (see [link](b)). 
What is the half-life of this material? 


Solution: 


The decay constant is equal to the negative value of the slope or 
10~°s~!. The half-life of the nuclei, and thus the material, is 
T} /2 = 693 million years. 


Exercise: 


Problem: Show that: 7’ = = 


Exercise: 


Problem: 


The half-life of strontium-91, 3!Sr is 9.70 h. Find (a) its decay 


constant and (b) for an initial 1.00-g sample, the activity after 15 
hours. 


Solution: 


a. The decay constant is \ = 1.99 x 10-°s 1. b. Since strontium-91 
has an atomic mass of 90.90 g, the number of nuclei in a 1.00-g sample 
is initially 
No = 6.63 x 107 nuclei. 
The initial activity for strontium-91 is 
— 1.32 x 10!” decays/s 
The activity att = 15.0h = 5.40 x 10*sis 
A=4.51 x 10’ decays/s. 


Exercise: 


Problem: 


A sample of pure carbon-14 (T; /2 = 9730 y) has an activity of 
1.0 yz Ci. What is the mass of the sample? 


Exercise: 
Problem: 
A radioactive sample initially contains 2.40 x 10°? mol ofa 


radioactive material whose half-life is 6.00 h. How many moles of the 
radioactive material remain after 6.00 h? After 12.0 h? After 36.0 h? 


Solution: 


1.20 x 10°? mol; 6.00 x 10-°mol; 3.75 x 10 “mol 
Exercise: 

Problem: 

An old campfire is uncovered during an archaeological dig. Its 

charcoal is found to contain less than 1/1000 the normal amount of 


4C. Estimate the minimum age of the charcoal, noting that 
2 = 1024. 


Exercise: 
Problem: 
Calculate the activity R, in curies of 1.00 g of ?”°Ra. (b) Explain why 


your answer is not exactly 1.00 Ci, given that the curie was originally 
supposed to be exactly the activity of a gram of radium. 


Solution: 


a. 0.988 Ci; b. The half-life of ?“°Ra is more precisely known than it 
was when the Ci unit was established. 


Exercise: 


Problem: 


Natural uranium consists of 72°U (percent abundance = 0.7200%, 
= 3.12 x 10°17/s) and 23°U (percent abundance = 99.27%, 
d = 4.92 x 10~18/s). What were the values for percent abundance 
of 7°°U and 7°8U when Earth formed 4.5 x 10° years ago? 


Exercise: 


Problem: 


World War II aircraft had instruments with glowing radium-painted 
dials. The activity of one such instrument was 1.0 x 10° Bq when 
new. (a) What mass of 22°Ra was present? (b) After some years, the 
phosphors on the dials deteriorated chemically, but the radium did not 
escape. What is the activity of this instrument 57.0 years after it was 
made? 


Solution: 


a. 2.73ug; b. 9.76 x 104 Bq 
Exercise: 


Problem: 


The 2!°Po source used in a physics laboratory is labeled as having an 
activity of 1.0 Ci on the date it was prepared. A student measures the 
radioactivity of this source with a Geiger counter and observes 1500 
counts per minute. She notices that the source was prepared 120 days 
before her lab. What fraction of the decays is she observing with her 
apparatus? 


Exercise: 


Problem: 


Armor-piercing shells with depleted uranium cores are fired by aircraft 
at tanks. (The high density of the uranium makes them effective.) The 
uranium is called depleted because it has had its 22°U removed for 
reactor use and is nearly pure 7°°U. Depleted uranium has been 
erroneously called nonradioactive. To demonstrate that this is wrong: 
(a) Calculate the activity of 60.0 g of pure 2°°U. (b) Calculate the 
activity of 60.0 g of natural uranium, neglecting the 7°4U and all 
daughter nuclides. 


Solution: 


a. 7.46 x 10° Bq;b.7.75 x 10° Bq 


Glossary 


activity 
magnitude of the decay rate for radioactive nuclides 


becquerel (Bq) 
SI unit for the decay rate of a radioactive material, equal to 1 
decay/second 


carbon-14 dating 
method to determine the age of formerly living tissue using the ratio 


st Or a ©: 


curie (Ci) 
unit of decay rate, or the activity of 1 g of ?#°Ra, equal to 
3.70 x 10/° Bq 


decay 
process by which an individual atomic nucleus of an unstable atom 


loses mass and energy by emitting ionizing particles 


decay constant 


quantity that is inversely proportional to the half-life and that is used in 


equation for number of nuclei as a function of time 


half-life 
time for half of the original nuclei to decay (or half of the original 
nuclei remain) 


lifetime 
average time that a nucleus exists before decaying 


radioactive dating 
application of radioactive decay in which the age of a material is 
determined by the amount of radioactivity of a particular type that 
occurs 


radioactive decay law 
describes the exponential decrease of parent nuclei in a radioactive 
sample 


radioactivity 
spontaneous emission of radiation from nuclei 


Nuclear Reactions 
By the end of this section, you will be able to: 


e Describe and compare three types of nuclear radiation 

e Use nuclear symbols to describe changes that occur during nuclear 
reactions 

e Describe processes involved in the decay series of heavy elements 


Early experiments revealed three types of nuclear “rays” or radiation: alpha 
(a) rays, beta (() rays, and gamma (7) rays. These three types of 
radiation are differentiated by their ability to penetrate matter. Alpha 
radiation is barely able to pass through a thin sheet of paper. Beta radiation 
can penetrate aluminum to a depth of about 3 mm, and gamma radiation can 
penetrate lead to a depth of 2 or more centimeters ((Link]). 


Alpha 8 —>> , 


Beta @ 


Gamma 


Paper Metal Concrete Lead 


A comparison of the penetration depths of alpha (q@), beta (@), and 
gamma (7) radiation through various materials. 


The electrical properties of these three types of radiation are investigated by 
passing them through a uniform magnetic field, as shown in [link]. 


According to the magnetic force equation F = gv x B, positively charged 


particles are deflected upward, negatively charged particles are deflected 
downward, and particles with no charge pass through the magnetic field 
undeflected. Eventually, a rays were identified with helium nuclei (*He), 8 
rays with electrons and positrons (positively charged electrons or 
antielectrons), and y rays with high-energy photons. We discuss alpha, 
beta, and gamma radiation in detail in the remainder of this section. 


The effect of a magnetic field on alpha (a), beta (G), and gamma (7) 
radiation. This figure is a schematic only. The relative paths of the 
particles depend on their masses and initial kinetic energies. 


Alpha Decay 


Heavy unstable nuclei emit a@ radiation. In a-particle decay (or alpha 
decay), the nucleus loses two protons and two neutrons, so the atomic 
number decreases by two, whereas its mass number decreases by four. 
Before the decay, the nucleus is called the parent nucleus. The nucleus or 


nuclei produced in the decay are referred to as the daughter nucleus or 
daughter nuclei. We represent an a decay symbolically by 


Note: 
Equation: 


(Dk S pk + SE 


where #X is the parent nucleus, #~4X is the daughter nucleus, and $He is 


the a particle. In a decay, a nucleus of atomic number Z decays into a 
nucleus of atomic number Z — 2 and atomic mass A — 4. Interestingly, the 
dream of the ancient alchemists to turn other metals into gold is 
scientifically feasible through the alpha-decay process. The efforts of the 
alchemists failed because they relied on chemical interactions rather than 
nuclear interactions. 


Note: 

Watch alpha particles escape from a polonium nucleus, causing radioactive 
alpha decay. See how random decay times relate to the half-life. To try a 
simulation of alpha decay, visit alpha particles 


An example of alpha decay is uranium-238: 
Equation: 


2 234 4 
3U — 735X + 5He. 


The atomic number has dropped from 92 to 90. The chemical element with 
Z = 90 is thorium. Hence, Uranium-238 has decayed to Thorium-234 by 


the emission of an @ particle, written 
Equation: 


238 234 4 


Subsequently, fae decays by 6 emission with a half-life of 24 days. The 


energy released in this alpha decay takes the form of kinetic energies of the 
thorium and helium nuclei, although the kinetic energy of thorium is 
smaller than helium due to its heavier mass and smaller velocity. 


Example: 

Plutonium Alpha Decay 

Find the energy emitted in the a decay of 7?9Pu. 

Strategy 

The energy emitted in the a decay of 7°?Pu can be found using the 
equation FE = (Am)c?. We must first find Am, the difference in mass 
between the parent nucleus and the products of the decay. 

Solution 

The decay equation is 

Equation: 


29Py — U + “He. 


Thus, the pertinent masses are those of 239Py, 235U and the a particle or 
“He, all of which are known. The initial mass was 
m (22Pu) = 239.052157 u. The final mass is the sum 


Equation: 
m (?U) +m (4He) = 235.043924 u + 4.002602 u 
= 239.046526 u. 


Thus, 
Equation: 


Am =m (7°Pu) — [m (Ga) +m (*He) | 
= 239.052157 u — 239.046526 u 
= 0.0005631 u. 


Now we can find E by entering Am into the equation: 
Equation: 


E = (Am)c? = (0.005631 u)c?. 


We know 1 u = 931.5 MeV/c?, so we have 
Equation: 


E = (0.005631) (931.5 MeV/c’) (c’) 
= 5.25 MeV. 


Significance 

The energy released in this a decay is in the MeV range, many times 
greater than chemical reaction energies. Most of this energy becomes 
kinetic energy of the a particle (or “He nucleus), which moves away at 
high speed. The energy carried away by the recoil of the 7°°U nucleus is 
much smaller due to its relatively large mass. The 7°°U nucleus can be left 
in an excited state to later emit photons (7¥ rays). 


Beta Decay 


In most @ particle decays (or beta decay), either an electron (G" ) or 
positron (G7) is emitted by a nucleus. A positron has the same mass as the 
electron, but its charge is +e. For this reason, a positron is sometimes 
called an antielectron. How does £ decay occur? A possible explanation is 
the electron (positron) is confined to the nucleus prior to the decay and 
somehow escapes. To obtain a rough estimate of the escape energy, consider 
a simplified model of an electron trapped in a box (or in the terminology of 
quantum mechanics, a one-dimensional square well) that has the width of a 
typical nucleus (10-‘*m). According to the Heisenberg uncertainty 


principle in Quantum Mechanics, the uncertainty of the momentum of the 
electron is: 
Equation: 


6.6 x 10° m?-k 
= ee ee ie m/s. 


h 
A ae 
P? Az 10-!4m 


Taking this momentum value (an underestimate) to be the “true value,” the 
kinetic energy of the electron on escape is approximately 
Equation: 


Ap)” 6.6 x 10° kg- m/s)” 
(Ap)” _ (6.6 x 10° kg - m/s)" = 2.0 x 10°-°J = 12,400 MeV. 
2me 2(9.1 x 10°*'kg) 


Experimentally, the electrons emitted in G~ decay are found to have kinetic 
energies of the order of only a few MeV. We therefore conclude that the 
electron is somehow produced in the decay rather than escaping the 
nucleus. Particle production (annihilation) is described by theories that 
combine quantum mechanics and relativity, a subject of a more advanced 
course in physics. 


Nuclear beta decay involves the conversion of one nucleon into another. For 
example, a neutron can decay to a proton by the emission of an electron ( 
@~) and a nearly massless particle called an antineutrino (Vv): 

Equation: 


pn > p+ e+. 


The notation ‘Ye is used to designate the electron. Its mass number is 0 


because it is not a nucleon, and its atomic number is —1 to signify that it 
has a charge of —e. The proton is represented by iP because its mass 
number and atomic number are 1. When this occurs within an atomic 
nucleus, we have the following equation for beta decay: 


Note: 
Equation: 


ees gai hae eeu. 


As discussed in another chapter, this process occurs due to the weak nuclear 
force. 


Note: 
Watch beta decay occur for a collection of nuclei or for an individual 
nucleus. 


As an example, the isotope te is unstable and decays by G~ emission 


with a half-life of 24 days. Its decay can be represented as 
Equation: 


234 234 0 
oth > “9X + _je + v. 
Since the chemical element with atomic number 91 is protactinium (Pa), we 


can write the G~ decay of thorium as 
Equation: 


“ag Th > alae} + ‘Se Ul. 
The reverse process is also possible: A proton can decay to a neutron by the 


emission of a positron (G*) and a nearly massless particle called a neutrino 
(v). This reaction is written as 7 — on =P fe +v. 


0 
a 


the nucleus. (Like 8” decay, the positron does not precede the decay but is 
produced in the decay.) For an isolated proton, this process is impossible 
because the neutron is heavier than the proton. However, this process is 
possible within the nucleus because the proton can receive energy from 
other nucleons for the transition. As an example, the isotope of aluminum 
“SAl decays by 8+ emission with a half-life of 7.40 x 10°y. The decay is 
written as 

Equation: 


The positron ,;e is emitted with the neutrino v, and the neutron remains in 


26 26 0 

Al > pX+ e+ v. 
The atomic number 12 corresponds to magnesium. Hence, 
Equation: 


‘sAl > 7$Mg + fet v. 


As a nuclear reaction, positron emission can be written as 


Note: 
Equation: 


ax — pans + te ete 


The neutrino was not detected in the early experiments on ( decay. 
However, the laws of energy and momentum seemed to require such a 
particle. Later, neutrinos were detected through their interactions with 
nuclei. 


Example: 
Bismuth Alpha and Beta Decay 
The a Bi nucleus undergoes both a@ and @~ decay. For each case, what is 
the daughter nucleus? 
Strategy 
We can use the processes described by [link] and [link], as well as the 
Periodic Table, to identify the resulting elements. 
Solution 
The atomic number and the mass number for the a particle are 2 and 4, 
respectively. Thus, when a bismuth-211 nucleus emits an a particle, the 
daughter nucleus has an atomic number of 81 and a mass number of 207. 
The element with an atomic number of 81 is thallium, so the decay is given 
by 
Equation: 

eee = emt Slee. 
In GB” decay, the atomic number increases by 1, while the mass number 
stays the same. The element with an atomic number of 84 is polonium, so 


the decay is given by 
Equation: 
75g Bi > *44Po+ e+. 
Note: 
Exercise: 
Problem: 


Check Your Understanding In radioactive beta decay, does the 
atomic mass number, A, increase or decrease? 


Solution: 


Neither; it stays the same. 


Gamma Decay 


A nucleus in an excited state can decay to a lower-level state by the 
emission of a “gamma-ray” photon, and this is known as gamma decay. 
This is analogous to de-excitation of an atomic electron. Gamma decay is 
represented symbolically by 


Note: 
Equation: 


POS Ee tay 


where the asterisk (*) on the nucleus indicates an excited state. In y decay, 
neither the atomic number nor the mass number changes, so the type of 
nucleus does not change. 


Radioactive Decay Series 


Nuclei with Z > 82 are unstable and decay naturally. Many of these nuclei 
have very short lifetimes, so they are not found in nature. Notable 
exceptions include iw (or Th-232) with a half-life of 1.39 x 10° 
years, and 738) (or U-238) with a half-life of 7.04 x 10° years. When a 
heavy nucleus decays to a lighter one, the lighter daughter nucleus can 
become the parent nucleus for the next decay, and so on. This process can 
produce a long series of nuclear decays called a decay series. The series 
ends with a stable nucleus. 


To illustrate the concept of a decay series, consider the decay of Th-232 
series ({link]). The neutron number, N, is plotted on the vertical y-axis, and 
the atomic number, Z, is plotted on the horizontal x-axis, so Th-232 is found 


at the coordinates (N, Z) = (142, 90). Th-232 decays by a emission with 
a half-life of 1.39 x 101° years. Alpha decay decreases the atomic number 
by 2 and the mass number by 4, so we have 

Equation: 


232 228 4 
99 Lh — “g3Ra + 5He. 


The neutron number for Radium-228 is 140, so it is found in the diagram at 
the coordinates (NV, Z) = (140, 88). Radium-228 is also unstable and 
decays by 6” emission with a half-life of 5.76 years to Actinum-228. The 
atomic number increases by 1, the mass number remains the same, and the 
neutron number decreases by 1. Notice that in the graph, a emission 
appears as a line sloping downward to the left, with both N and Z 
decreasing by 2. Beta emission, on the other hand, appears as a line sloping 
downward to the right with N decreasing by 1, and Z increasing by 1. After 
several additional alpha and beta decays, the series ends with the stable 
nucleus Pb-208. 


The relative frequency of different types of radioactive decays (alpha, beta, 
and gamma) depends on many factors, including the strength of the forces 
involved and the number of ways a given reaction can occur without 
violating the conservation of energy and momentum. How often a 
radioactive decay occurs often depends on a sensitive balance of the strong 
and electromagnetic forces. These forces are discussed in Particle Physics 
and Cosmology. 
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In the thorium ral decay series, alpha (a) decays reduce the atomic 


number, as indicated by the red arrows. Beta (G ) decays increase the 
atomic number, as indicated by the blue arrows. The series ends at the 
stable nucleus Pb-208. 


As another example, consider the U-238 decay series shown in [link]. After 
numerous alpha and beta decays, the series ends with the stable nucleus Pb- 
206. An example of a decay whose parent nucleus no longer exists naturally 
is shown in [link]. It starts with Neptunium-237 and ends in the stable 
nucleus Bismuth-209. Neptunium is called a transuranic element because 
it lies beyond uranium in the periodic table. Uranium has the highest atomic 


number (Z = 92) of any element found in nature. Elements with Z > 92 
can be produced only in the laboratory. They most probably also existed in 
nature at the time of the formation of Earth, but because of their relatively 
short lifetimes, they have completely decayed. There is nothing 
fundamentally different between naturally occurring and artificial elements. 
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In the Uranium-238 decay series, alpha (a) decays reduce the atomic 
number, as indicated by the red arrows. Beta (G~ ) decays increase the 
atomic number, as indicated by the blue arrows. The series ends at the 


stable nucleus Pb-206. 


Notice that for Bi (21), the decay may proceed through either alpha or beta 
decay. 
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In the Neptunium-237 decay series, alpha (a) decays reduce the 
atomic number, as indicated by the red arrows. Beta (~ ) decays 
increase the atomic number, as indicated by the blue arrows. The series 
ends at the stable nucleus Bi-209. 


Radioactivity in the Earth 


According to geologists, if there were no heat source, Earth should have 
cooled to its present temperature in no more than 1 billion years. Yet, Earth 
is more than 4 billion years old. Why is Earth cooling so slowly? The 
answer is nuclear radioactivity, that is, high-energy particles produced in 
radioactive decays heat Earth from the inside ([link]). 


t 


Radiation XY 


Earth is heated by nuclear reactions (alpha, beta, and gamma decays). 
Without these reactions, Earth’s surface would be much cooler than it 
is now. 


Candidate nuclei for this heating model are 228U and “°K, which possess 
half-lives similar to or longer than the age of Earth. The energy produced by 
these decays (per second per cubic meter) is small, but the energy cannot 
escape easily, so Earth’s core is very hot. Thermal energy in Earth’s core is 
transferred to Earth’s surface and away from it through the processes of 
convection, conduction, and radiation. 


Summary 


e The three types of nuclear radiation are alpha (a) rays, beta (9) rays, 
and gamma (7) rays. 
¢ We represent a: decay symbolically by 4X —> eX + 5He. There are 


two types of 8 decay: either an electron (8~ ) or a positron (8°) is 
emitted by a nucleus. yy decay is represented symbolically by 
oX* 3X +74. 

e When a heavy nucleus decays to a lighter one, the lighter daughter 
nucleus can become the parent nucleus for the next decay, and so on, 
producing a decay series. 


Conceptual Questions 


Exercise: 
Problem: 
What is the key difference and the key similarity between beta (37 ) 
decay and alpha decay? 
Exercise: 
Problem: 


What is the difference between ¥ rays and characteristic X-rays and 
visible light? 


Solution: 


Gamma (y) rays are produced by nuclear interactions and X-rays and 
light are produced by atomic interactions. Gamma rays are typically 
shorter wavelength than X-rays, and X-rays are shorter wavelength 
than light. 


Exercise: 
Problem: 
What characteristics of radioactivity show it to be nuclear in origin and 
not atomic? 
Exercise: 
Problem: 
Consider [link]. If the magnetic field is replaced by an electric field 


pointed in toward the page, in which directions will the a-, 6*-, and 7 
rays bend? 


Solution: 


Assume a rectangular coordinate system with an xy-plane that 
corresponds to the plane of the paper. a bends into the page (trajectory 
parabolic in the xz-plane); 8* bends into the page (trajectory parabolic 
in the xz-plane); and + is unbent. 


Exercise: 


Problem: Why is Earth’s core molten? 


Problems 


Exercise: 


Problem: 


249 Cf undergoes alpha decay. (a) Write the reaction equation. (b) Find 
the energy released in the decay. 


Exercise: 
Problem: 
(a) Calculate the energy released in the a decay of 7°°U. (b) What 
fraction of the mass of a single 7°°U is destroyed in the decay? The 
mass of 2°*Th is 234.043593 u. (c) Although the fractional mass loss 


is large for a single nucleus, it is difficult to observe for an entire 
macroscopic sample of uranium. Why is this? 


Solution: 


a. 4.273 MeV; b. 1.927 x 10~°; c. Since 7°°U is a slowly decaying 
substance, only a very small number of nuclei decay on human 
timescales; therefore, although those nuclei that decay lose a 
noticeable fraction of their mass, the change in the total mass of the 
sample is not detectable for a macroscopic sample. 


Exercise: 
Problem: 
The §~ particles emitted in the decay of 3H (tritium) interact with 
matter to create light in a glow-in-the-dark exit sign. At the time of 


manufacture, such a sign contains 15.0 Ci of 2H. (a) What is the mass 
of the tritium? (b) What is its activity 5.00 y after manufacture? 


Exercise: 


Problem: 


(a) Write the complete 8~ decay equation for °°Sr, a major waste 
product of nuclear reactors. (b) Find the energy released in the decay. 


Solution: 


a. 28Sr52 > 2051 + 8! + ve; b. 0.546 MeV 


Exercise: 


Problem: 


Write a nuclear §~ decay reaction that produces the 9°Y nucleus. 
(Hint: The parent nuclide is a major waste product of reactors and has 
chemistry similar to calcium, so that it is concentrated in bones if 
ingested.) 


Exercise: 


Problem: 


Write the complete decay equation in the complete pee N notation for 
the beta (3) decay of *H (tritium), a manufactured isotope of 
hydrogen used in some digital watch displays, and manufactured 
primarily for use in hydrogen bombs. 


Solution: 


iH. — $He; + B + ve 
Exercise: 
Problem: 
If a 1.50-cm-thick piece of lead can absorb 90.0% of the rays from a 


radioactive source, how many centimeters of lead are needed to absorb 
all but 0.100% of the rays? 


Exercise: 


Problem: 


An electron can interact with a nucleus through the beta-decay 
process: 


aXte° 39 Y+u. 
(a) Write the complete reaction equation for electron capture by ‘Be. 


(b) Calculate the energy released. 


Solution: 


a. ;Bes +e —> SLi4g + v¢; b. 0.862 MeV 
Exercise: 
Problem: 


(a) Write the complete reaction equation for electron capture by /°O. 


(b) Calculate the energy released. 
Exercise: 
Problem: 
A rare decay mode has been observed in which 72?Ra emits a ‘*C 


nucleus. (a) The decay equation is ?27Ra > 4X + 14C. Identify the 


nuclide 4X. (b) Find the energy emitted in the decay. The mass of 
222Ra is 222.015353 u. 


Solution: 


a. X = 738Pby96; b. 33.05 MeV 


Glossary 


alpha decay 
radioactive nuclear decay associated with the emission of an alpha 
particle 


alpha (@) rays 
one of the types of rays emitted from the nucleus of an atom as alpha 
particles 


antielectrons 
another term for positrons 


antineutrino 


antiparticle of an electron’s neutrino in G~ decay 


beta decay 
radioactive nuclear decay associated with the emission of a beta 
particle 


beta (8) rays 
one of the types of rays emitted from the nucleus of an atom as beta 
particles 


daughter nucleus 
nucleus produced by the decay of a parent nucleus 


decay series 
series of nuclear decays ending in a stable nucleus 


gamma decay 
radioactive nuclear decay associated with the emission of gamma 
radiation 


gamma (7) rays 
one of the types of rays emitted from the nucleus of an atom as gamma 
particles 


neutrino 
subatomic elementary particle which has no net electric charge 


parent nucleus 
original nucleus before decay 


positron 
electron with positive charge 


transuranic element 
element that lies beyond uranium in the periodic table 


Fission 
By the end of this section, you will be able to: 


e Describe the process of nuclear fission in terms of its product and reactants 

e Calculate the energies of particles produced by a fission reaction 

e Explain the fission concept in the context of fission bombs and nuclear 
reactions 


In 1934, Enrico Fermi bombarded chemical elements with neutrons in order to 
create isotopes of other elements. He assumed that bombarding uranium with 
neutrons would make it unstable and produce a new element. Unfortunately, Fermi 
could not determine the products of the reaction. Several years later, Otto Hahn 
and Fritz Strassman reproduced these experiments and discovered that the 
products of these reactions were smaller nuclei. From this, they concluded that the 
uranium nucleus had split into two smaller nuclei. 


The splitting of a nucleus is called fission. Interestingly, U-235 fission does not 
always produce the same fragments. Example fission reactions include: 
Equation: 


in +733U > Ba + 2Kr + 34n+Q, 


jn + 2350 — 10 XKe + 2Sr + 2)n+Q, 
gn + 788U — sSn + '3Mo+ 35n+Q. 


In each case, the sum of the masses of the product nuclei are less than the masses 
of the reactants, so the fission of uranium is an exothermic process (Q > 0). This 
is the idea behind the use of fission reactors as sources of energy ([link]). The 
energy carried away by the reaction takes the form of particles with kinetic energy. 
The percent yield of fragments from a U-235 fission is given in [link]. 


The Phillipsburg Nuclear Power Plant in Germany uses a fission reactor to 
generate electricity. 
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In this graph of fission fragments from U- 

235, the peaks in the graph indicate nuclei 

that are produced in the greatest abundance 
by the fission process. 


Energy changes in a nuclear fission reaction can be understood in terms of the 
binding energy per nucleon curve ((link]). The BEN value for uranium (A = 236) 
is slightly lower than its daughter nuclei, which lie closer to the iron (Fe) peak. 
This means that nucleons in the nuclear fragments are more tightly bound than 
those in the U-235 nucleus. Therefore, a fission reaction results in a drop in the 
average energy of a nucleon. This energy is carried away by high-energy neutrons. 


Niels Bohr and John Wheeler developed the liquid drop model to understand the 
fission process. According to this model, firing a neutron at a nucleus is analogous 
to disturbing a droplet of water ((link]). The analogy works because short-range 
forces between nucleons in a nucleus are similar to the attractive forces between 


water molecules in a water droplet. In particular, forces between nucleons at the 
surface of the nucleus result in a surface tension similar to that of a water droplet. 
A neutron fired into a uranium nucleus can set the nucleus into vibration. If this 
vibration is violent enough, the nucleus divides into smaller nuclei and also emits 
two or three individual neutrons. 
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In the liquid drop model of nuclear fission, the uranium nucleus is split into 
two lighter nuclei by a high-energy neutron. 


U-235 fission can produce a chain reaction. In a compound consisting of many U- 
235 nuclei, neutrons in the decay of one U-235 nucleus can initiate the fission of 
additional U-235 nuclei ({link]). This chain reaction can proceed in a controlled 
manner, as in a nuclear reactor at a power plant, or proceed uncontrollably, as in an 
explosion. 


Neutron = 288U 
in 


In a U-235 fission chain reaction, the fission of the uranium nucleus produces 
high-energy neutrons that go on to split more nuclei. The energy released in 
this process can be used to produce electricity. 


Note: 


View a simulation on nuclear fission to start a chain reaction, or introduce 
nonradioactive isotopes to prevent one. Control energy production in a nuclear 
reactor. 


The Atomic Bomb 


The possibility of a chain reaction in uranium, with its extremely large energy 
release, led nuclear scientists to conceive of making a bbmb—an atomic bomb. 
(These discoveries were taking place in the years just prior to the Second World 
War and many of the European physicists involved in these discoveries came from 
countries that were being overrun.) Natural uranium contains 99.3% U-238 and 
only 0.7% U-235, and does not produce a chain reaction. To produce a controlled, 
sustainable chain reaction, the percentage of U-235 must be increased to about 
50%. In addition, the uranium sample must be massive enough so a typical 
neutron is more likely to induce fission than it is to escape. The minimum mass 
needed for the chain reaction to occur is called the critical mass. When the critical 
mass reaches a point at which the chain reaction becomes self-sustaining, this is a 
condition known as criticality. The original design required two pieces of U-235 
below the critical mass. When one piece in the form of a bullet is fired into the 
second piece, the critical mass is exceeded and a chain reaction is produced. 


An important obstacle to the U-235 bomb is the production of a critical mass of 
fissionable material. Therefore, scientists developed a plutonium-239 bomb 
because Pu-239 is more fissionable than U-235 and thus requires a smaller critical 
mass. The bomb was made in the form of a sphere with pieces of plutonium, each 
below the critical mass, at the edge of the sphere. A series of chemical explosions 
fired the plutonium pieces toward the center of the sphere simultaneously. When 
all these pieces of plutonium came together, the combination exceeded the critical 
mass and produced a chain reaction. Both the U-235 and Pu-239 bombs were used 
in World War II. Whether to develop and use atomic weapons remain two of the 
most important questions faced by human civilization. 


Example: 

Calculating Energy Released by Fission 

Calculate the energy released in the following rare spontaneous fission reaction: 
Equation: 


381, Sr + Xe + 3n, 


The atomic masses are m (*°°U) = 238.050784 u, m (°°Sr) = 94.919388 u, 
m (1“°Xe) = 139.921610 u, and m (n) = 1.008665 u. 

Strategy 

As always, the energy released is equal to the mass destroyed times c“, sO we 
must find the difference in mass between the parent 7°°U and the fission products. 
Solution 

The products have a total mass of 

Equation: 


2 


Mproducts = 94.919388 u + 139.921610 u + 3 (1.008665 u) 
= 237.866993 u. 


The mass lost is the mass of 2°°5U — Mproducts OF 
Equation: 


Am = 238.050784 u — 237.8669933 u = 0.183791 u. 


Therefore, the energy released is 
Equation: 


c? = 171.2 MeV. 


931.5 MeV/c? 
E = (Am)c? = (0.183791 jae 


Significance 

Several important things arise in this example. The energy release is large but less 
than it would be if the nucleus split into two equal parts, since energy is carried 
away by neutrons. However, this fission reaction produces neutrons and does not 
split the nucleus into two equal parts. Fission of a given nuclide, such as 7°3U, 
does not always produce the same products. Fission is a statistical process in 
which an entire range of products are produced with various probabilities. Most 
fission produces neutrons, although the number varies. This is an extremely 
important aspect of fission, because neutrons can induce more fission, enabling 
self-sustaining chain reactions. 


Fission Nuclear Reactors 


The first nuclear reactor was built by Enrico Fermi on a squash court on the 
campus of the University of Chicago on December 2, 1942. The reactor itself 
contained U-238 enriched with 3.6% U-235. Neutrons produced by the chain 
reaction move too fast to initiate fission reactions. One way to slow them down is 
to enclose the entire reactor in a water bath under high pressure. The neutrons 
collide with the water molecules and are slowed enough to be used in the fission 
process. The slowed neutrons split more U-235 nuclei and a chain reaction occurs. 
The rate at which the chain reaction proceeds is controlled by a series of “control” 
rods made of cadmium inserted into the reactor. Cadmium is capable of absorbing 
a large number of neutrons without becoming unstable. 


A nuclear reactor design, called a pressurized water reactor, can also be used to 
generate electricity ({link]). A pressurized water reactor (on the left in the figure) 
is designed to control the fission of large amounts of 2*°U. The energy released in 
this process is absorbed by water flowing through pipes in the system (the 
“primary loop”) and steam is produced. Cadmium control rods adjust the neutron 
flux (the rate of flow of neutrons passing through the system) and therefore control 
the reaction. In case the reactor overheats and the water boils away, the chain 
reaction terminates, because water is used to thermalize the neutrons. (This safety 
feature can be overwhelmed in extreme circumstances.) The hot, high-pressure 
water then passes through a pipe to a second tank of water at normal pressure in 
the steam generator. The steam produced at one end of the steam generator fills a 
chamber that contains a turbine. This steam is at a very high pressure. Meanwhile, 
a steam condenser connected to the other side of the turbine chamber maintains 
steam at low pressure. The pressure differences force steam through the chamber, 
which turns the turbine. The turbine, in turn, powers an electric generator. 
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A nuclear reactor uses the energy produced in the fission of U-235 to produce 
electricity. Energy from a nuclear fission reaction produces hot, high-pressure 
steam that turns a turbine. As the turbine turns, electricity is produced. 


The major drawback to a fission reactor is nuclear waste. U-235 fission produces 
nuclei with long half-lives such as 2°°U that must be stored. These products cannot 
be dumped into oceans or left in any place where they will contaminate the 
environment, such as through the soil, air, or water. Many scientists believe that 
the best place to store nuclear waste is the bottom of old salt mines or inside of 
stable mountains. 


Many people are fearful that a nuclear reactor may explode like an atomic bomb. 
However, a nuclear reactor does not contain enough U-235 to do this. Also, a 
nuclear reactor is designed so that failure of any mechanism of the reactor causes 
the cadmium control rods to fall fully into the reactor, stopping the fission process. 
As evidenced by the Fukushima and Chernobyl disasters, such systems can fail. 
Systems and procedures to avoid such disasters is an important priority for 
advocates of nuclear energy. 


If all electrical power were produced by nuclear fission of U-235, Earth’s known 
reserves of uranium would be depleted in less than a century. However, Earth’s 
supply of fissionable material can be expanded considerably using a breeder 
reactor. A breeder reactor operates for the first time using the fission of U-235 as 


just described for the pressurized water reactor. But in addition to producing 
energy, some of the fast neutrons originating from the fission of U-235 are 
absorbed by U-238, resulting in the production of Pu-239 via the set of reactions 
Equation: 


1 2 2 p p 
qn 8 > BU 2ONp APU. 


The Pu-239 is itself highly fissionable and can therefore be used as a nuclear fuel 
in place of U-235. Since 99.3% of naturally occurring uranium is the U-238 
isotope, the use of breeder reactors should increase our supply of nuclear fuel by 
roughly a factor of 100. Breeder reactors are now in operation in Great Britain, 
France, and Russia. Breeder reactors also have drawbacks. First, breeder reactors 
produce plutonium, which can, if leaked into the environment, produce serious 
public health problems. Second, plutonium can be used to build bombs, thus 
increasing significantly the risk of nuclear proliferation. 


Example: 

Calculating Energy of Fissionable Fuel 

Calculate the amount of energy produced by the fission of 1.00 kg of 7°°U given 
that the average fission reaction of 7°°U produces 200 MeV. 

Strategy 

The total energy produced is the number of 7°°U atoms times the given energy per 
235UJ fission. We should therefore find the number of ?°°U atoms in 1.00 kg. 
Solution 

The number of ?°U atoms in 1.00 kg is Avogadro’s number times the number of 
moles. One mole of 7°°U has a mass of 235.04 g; thus, there are 

(1000 g)/ (235.04 g/mol) = 4.25 mol. The number of 7°°U atoms is therefore 
Equation: 


(4.25 mol) (6.02 x 107 78°U/mol) = 2.56 x 1074 ?°°U. 


Thus, the total energy released is 
Equation: 


—13 
B= (2.56 x 10725) —) i x 10-8 J 


= 13 
2351] Mev ) = 8.21 x 107° J. 


Significance 

This is another impressively large amount of energy, equivalent to about 14,000 
barrels of crude oil or 600,000 gallons of gasoline. However, it is only one-fourth 
the energy produced by the fusion of a kilogram mixture of deuterium and tritium. 
Even though each fission reaction yields about 10 times the energy of a fusion 
reaction, the energy per kilogram of fission fuel is less, because there are far 
fewer moles per kilogram of the heavy nuclides. Fission fuel is also much scarcer 
than fusion fuel, and less than 1% of uranium (the ?°°U) is readily usable. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Which has a larger energy yield per fission 
reaction, a large or small sample of pure 7?°U? 


Solution: 


the same 


Summary 


e Nuclear fission is a process in which the sum of the masses of the product 
nuclei are less than the masses of the reactants. 

e Energy changes in a nuclear fission reaction can be understood in terms of the 
binding energy per nucleon curve. 

e The production of new or different isotopes by nuclear transformation is 
called breeding, and reactors designed for this purpose are called breeder 
reactors. 


Conceptual Questions 


Exercise: 


Problem: Should an atomic bomb really be called nuclear bomb? 


Solution: 
Yes. An atomic bomb is a fission bomb, and a fission bomb occurs by 
splitting the nucleus of atom. 


Exercise: 


Problem: Why does a chain reaction occur during a fission reaction? 


Exercise: 


Problem: In what way is an atomic nucleus like a liquid drop? 


Solution: 


Short-range forces between nucleons in a nucleus are analogous to the forces 
between water molecules in a water droplet. In particular, the forces between 
nucleons at the surface of the nucleus produce a surface tension similar to that 
of a water droplet. 


Problems 


Exercise: 
Problem: 
A large power reactor that has been in operation for some months is turned 
off, but residual activity in the core still produces 150 MW of power. If the 


average energy per decay of the fission products is 1.00 MeV, what is the core 
activity? 


Exercise: 
Problem: 


(a) Calculate the energy released in this rare neutron-induced fission 
n+ 8 — %Sr + 140Xe + 3n, given m (Sr) = 95.921750 u and 


m (4°Xe) = 139.92164. 


(b) This result is about 6 MeV greater than the result for spontaneous fission. 
Why? 


(c) Confirm that the total number of nucleons and total charge are conserved 
in this reaction. 


Solution: 
a. 177.1 MeV; b. This value is approximately equal to the average BEN for 
n+ *5Ui46 —> 3eSr5s + ‘5qXes6 + 3n, 
heavy nuclei. c. A; = 239= Ag, 
4, =92=384+54= Z 
Exercise: 
Problem: 
(a) Calculate the energy released in the neutron-induced fission reaction 
n+?U — ?Kr + Ba + 2n, given m (**Kr) = 91.926269 u and 


m (142Ba) = 141.916361 u. (b) Confirm that the total number of nucleons 
and total charge are conserved in this reaction. 


Exercise: 


Problem: 


The electrical power output of a large nuclear reactor facility is 900 MW. It 
has a 35.0% efficiency in converting nuclear power to electrical power. 


(a) What is the thermal nuclear power output in megawatts? 


(b) How many 7°°U nuclei fission each second, assuming the average fission 
produces 200 MeV? 


(c) What mass of 7°°U is fissioned in 1 year of full-power operation? 
Solution: 


a.2.57 x 10° MW; b. 8.04 x 10? fissions/s; c. 991 kg 
Exercise: 


Problem: 


Find the total energy released if 1.00 kg of aU were to undergo fission. 


Glossary 


breeder reactor 
reactor that is designed to make plutonium 


criticality 
condition in which a chain reaction easily becomes self-sustaining 


critical mass 
minimum mass required of a given nuclide in order for self-sustained fission 
to occur 


fission 
splitting of a nucleus 


liquid drop model 
model of nucleus (only to understand some of its features) in which nucleons 
in a nucleus act like atoms in a drop 


Nuclear Fusion 
By the end of this section, you will be able to: 


e Describe the process of nuclear fusion in terms of its product and reactants 

¢ Calculate the energies of particles produced by a fusion reaction 

e Explain the fission concept in the context of fusion bombs, the production of 
energy by the Sun, and nucleosynthesis 


The process of combining lighter nuclei to make heavier nuclei is called nuclear 
fusion. As with fission reactions, fusion reactions are exothermic—they release 
energy. Suppose that we fuse a carbon and helium nuclei to produce oxygen: 
Equation: 


204 4He + 180 +4. 


The energy changes in this reaction can be understood using a graph of binding 
energy per nucleon ([link]). Comparing the binding energy per nucleon for oxygen, 
carbon, and helium, the oxygen nucleus is much more tightly bound than the carbon 
and helium nuclei, indicating that the reaction produces a drop in the energy of the 
system. This energy is released in the form of gamma radiation. Fusion reactions are 
said to be exothermic when the amount of energy released (known as the Q value) in 
each reaction is greater than zero (Q > 0). 


An important example of nuclear fusion in nature is the production of energy in the 
Sun. In 1938, Hans Bethe proposed that the Sun produces energy when hydrogen 
nuclei (+H) fuse into stable helium nuclei (“He) in the Sun’s core ({link]). This 
process, called the proton-proton chain, is summarized by three reactions: 


Equation: 
iH + {H > 7H+ ,Je+v4+Q, 
iH_ + 7H > 3He +7 + Q, 
3He + 3He > $He + [H+ 1H+Q. 


Thus, a stable helium nucleus is formed from the fusion of the nuclei of the hydrogen 
atom. These three reactions can be summarized by 
Equation: 


4tH > 5He+ 2 $e+2y+2v+Q. 


The net Q value is about 26 MeV. The release of this energy produces an outward 
thermal gas pressure that prevents the Sun from gravitational collapse. 
Astrophysicists find that hydrogen fusion supplies the energy stars require to 
maintain energy balance over most of a star's life span. 


The Sun produces energy by fusing hydrogen into helium at the Sun’s 
core. The red arrows show outward pressure due to thermal gas, which 
tends to make the Sun expand. The blue arrows show inward pressure due 
to gravity, which tends to make the Sun contract. These two influences 
balance each other. 


Nucleosynthesis 


Scientist now believe that many heavy elements found on Earth and throughout the 
universe were originally synthesized by fusion within the hot cores of the stars. This 
process is known as nucleosynthesis. For example, in lighter stars, hydrogen 
combines to form helium through the proton-proton chain. Once the hydrogen fuel is 
exhausted, the star enters the next stage of its life and fuses helium. An example of a 
nuclear reaction chain that can occur is: 

Equation: 


sHe + $He > ®Be+ 7, 
8Be + $He > 4C+4, 
2C + $He > 40+ 4. 


Carbon and oxygen nuclei produced in such processes eventually reach the star’s 
surface by convection. Near the end of its lifetime, the star loses its outer layers into 
space, thus enriching the interstellar medium with the nuclei of heavier elements 
({link]). 


A planetary nebula is produced at the end of the life of a star. The greenish color 
of this planetary nebula comes from oxygen ions. (credit: Hubble Heritage Team 
(STScI/AURA/NASA/ESA) ) 


Stars similar in mass to the Sun do not become hot enough to fuse nuclei as heavy (or 
heavier) than oxygen nuclei. However, in massive stars whose cores become much 
hotter (T >6 x 108 K), even more complex nuclei are produced. Some 
representative reactions are 


Equation: 
120 +20 + Na + 1H, 
2C > UC Mg + 7, 
126 + 160 > 8Si +7. 


Nucleosynthesis continues until the core is primarily iron-nickel metal. Now, iron has 
the peculiar property that any fusion or fission reaction involving the iron nucleus is 
endothermic, meaning that energy is absorbed rather than produced. Hence, nuclear 
energy cannot be generated in an iron-rich core. Lacking an outward pressure from 
fusion reactions, the star begins to contract due to gravity. This process heats the core 
to a temperature on the order of 5 x 10°K. Expanding shock waves generated 
within the star due to the collapse cause the star to quickly explode. The luminosity 
of the star can increase temporarily to nearly that of an entire galaxy. During this 
event, the flood of energetic neutrons reacts with iron and the other nuclei to produce 
elements heavier than iron. These elements, along with much of the star, are ejected 
into space by the explosion. Supernovae and the formation of planetary nebulas 
together play a major role in the dispersal of chemical elements into space. 


Eventually, much of the material lost by stars is pulled together through the 
gravitational force, and it condenses into a new generation of stars and accompanying 
planets. Recent images from the Hubble Space Telescope provide a glimpse of this 
magnificent process taking place in the constellation Serpens ((link]). The new 
generation of stars begins the nucleosynthesis process anew, with a higher percentage 
of heavier elements. Thus, stars are “factories” for the chemical elements, and many 
of the atoms in our bodies were once a part of stars. 


This image taken by NASA’s Spitzer Space Telescope 
and the Two Micron All Sky Survey (2MASS), shows 
the Serpens Cloud Core, a star-forming region in the 
constellation Serpens (the “Serpent”). Located about 
750 light-vears awav. this cluster of stars is farmed 
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from cooling dust and gases. Infrared light has been 
used to reveal the youngest stars in orange and yellow. 
(credit: NASA/JPL-Caltech/2MASS) 


Example: 

Energy of the Sun 

The power output of the Sun is approximately 3.8 x 107° J/s. Most of this energy 
is produced in the Sun’s core by the proton-proton chain. This energy is transmitted 
outward by the processes of convection and radiation. (a) How many of these fusion 
reactions per second must occur to supply the power radiated by the Sun? (b) What 
is the rate at which the mass of the Sun decreases? (c) In about five billion years, the 
central core of the Sun will be depleted of hydrogen. By what percentage will the 
mass of the Sun have decreased from its present value when the core is depleted of 
hydrogen? 

Strategy 

The total energy output per second is given in the problem statement. If we know the 
energy released in each fusion reaction, we can determine the rate of the fusion 
reactions. If the mass loss per fusion reaction is known, the mass loss rate is known. 
Multiplying this rate by five billion years gives the total mass lost by the Sun. This 
value is divided by the original mass of the Sun to determine the percentage of the 
Sun’s mass that has been lost when the hydrogen fuel is depleted. 

Solution 


a. The decrease in mass for the fusion reaction is 


Equation: 
Am = 4m ({H) —m (jHe) — 2m (Se) 
= 4(1.007825 u) — 4.002603 u — 2 (0.000549 u) 
a2 reste 


The energy released per fusion reaction is 
Equation: 


Q = (0.0276 u) (931.49 MeV/u) = 25.7 MeV. 


Thus, to supply 3.8 x 107° J/s = 2.38 x 10°° MeV/s, there must be 
Equation: 


2.38 x 10°° MeV/s 


—~ — 9.26 x 10°” reaction/s. 
25.7 MeV /reaction 


b. The Sun’s mass decreases by 0.0276 u = 4.58 x 10~7°kg per fusion reaction, 
so the rate at which its mass decreases is 
Equation: 


(9.26 x Hees reaction/s) (4.58 x 10°” kg/reaction) = 4.24 x 10°kg/s. 


In5 x 10°y=1.6 x 10!"s, the Sun’s mass will therefore decrease by 
Equation: 


2 


AVE (4 AA ce eG es) 6:8 ole ices 


The current mass of the Sun is about 2.0 x 10°” kg, so the percentage 
decrease in its mass when its hydrogen fuel is depleted will be 
Equation: 
G.85< 10; ak 
Gaerne x 100% = 0.034%. 
2.0 x 10° kg 


Significance 

After five billion years, the Sun is very nearly the same mass as it is now. Hydrogen 
burning does very little to change the mass of the Sun. This calculation assumes that 
only the proton-proton decay change is responsible for the power output of the Sun. 


Note: 
Exercise: 


Problem: 
Check Your Understanding Where does the energy from the Sun originate? 


Solution: 


the conversion of mass to energy 


The Hydrogen Bomb 


In 1942, Robert Oppenheimer suggested that the extremely high temperature of an 
atomic bomb could be used to trigger a fusion reaction between deuterium and 
tritium, thus producing a fusion (or hydrogen) bomb. The reaction between deuterium 
and tritium, both isotopes of hydrogen, is given by 

Equation: 


+H + 3H — $He + gn + 17.6 MeV. 


Deuterium is relatively abundant in ocean water but tritium is scarce. However, 
tritium can be generated in a nuclear reactor through a reaction involving lithium. 
The neutrons from the reactor cause the reaction 

Equation: 


pn + SLi + $He + 3H + pn, 


to produce the desired tritium. The first hydrogen bomb was detonated in 1952 on the 
remote island of Eniwetok in the Marshall Islands. A hydrogen bomb has never been 
used in war. Modern hydrogen bombs are approximately 1000 times more powerful 
than the fission bombs dropped on Hiroshima and Nagasaki in World War II. 


The Fusion Reactor 


The fusion chain believed to be the most practical for use in a nuclear fusion reactor 
is the following two-step process: 
Equation: 


2H + 2H — 3H + 1H, 
2H + 3H > 4He + In. 


This chain, like the proton-proton chain, produces energy without any radioactive by- 
product. However, there is a very difficult problem that must be overcome before 
fusion can be used to produce significant amounts of energy: Extremely high 


temperatures (~10’K) are needed to drive the fusion process. To meet this challenge, 
test fusion reactors are being developed to withstand temperatures 20 times greater 
than the Sun’s core temperature. An example is the Joint European Torus (JET) 
shown in [link]. A great deal of work still has to be done on fusion reactor 
technology, but many scientists predict that fusion energy will power the world’s 
cities by the end of the twentieth century. 
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The Joint European Torus (JET) tokamak fusion detector uses magnetic fields to 
fuse deuterium and tritium nuclei (credit: EUROfusion). 


Summary 


¢ Nuclear fusion is a reaction in which two nuclei are combined to form a larger 
nucleus; energy is released when light nuclei are fused to form medium-mass 
nuclei. 

e The amount of energy released by a fusion reaction is known as the Q value. 


e Nuclear fusion explains the reaction between deuterium and tritium that 
produces a fusion (or hydrogen) bomb; fusion also explains the production of 
energy in the Sun, the process of nucleosynthesis, and the creation of the heavy 
elements. 


Conceptual Questions 
Exercise: 
Problem: Explain the difference between nuclear fission and nuclear fusion. 


Exercise: 


Problem: 

Why does the fusion of light nuclei into heavier nuclei release energy? 
Solution: 

The nuclei produced in the fusion process have a larger binding energy per 
nucleon than the nuclei that are fused. That is, nuclear fusion decreases average 
energy of the nucleons in the system. The energy difference is carried away as 
radiation. 


Problems 


Exercise: 


Problem: 


Verify that the total number of nucleons, and total charge are conserved for each 
of the following fusion reactions in the proton-proton chain. 


(i) 'H+ 1H > 7H+et+%, 
(ii) 'H+ 7H > *He + 4, and (iii) *He + *He > *He + ‘H+ !H. 


(List the value of each of the conserved quantities before and after each of the 
reactions.) 


Solution: 


iH +}H > ?7H+ et +, 
i. A; =14+1=2;A;=2 Z,=14+1=2; 
Ze Silt l=2 
1H+7H + 3H+y¥ 
Hi, Ay SHlH2 =] 3A; =340=3:47=1+-b=2; 
Ze = 1b 1=2 
SH+2H — $H+i1H+iH 
Zh =2+1+1=4 
Exercise: 


Problem: 


Calculate the energy output in each of the fusion reactions in the proton-proton 
chain, and verify the values determined in the preceding problem. 


Exercise: 
Problem: 


Show that the total energy released in the proton-proton chain is 26.7 MeV, 
considering the overall effect in ‘'H + 4H > 7H + et + v%, 


1H + 7H — ?He +7, and °He + *He — *He + 1H + !H. Be sure to include 
the annihilation energy. 


Solution: 


26.73 MeV 
Exercise: 


Problem: 


Two fusion reactions mentioned in the text aren + ?He + “He + 7 and 
n+ +H — 7H + 4. Both reactions release energy, but the second also creates 
more fuel. Confirm that the energies produced in the reactions are 20.58 and 
2.22 MeV, respectively. Comment on which product nuclide is most tightly 
bound, “He or 2H. 


Exercise: 


Problem: 


The power output of the Sun is 4 x 107°W. (a) If 90% of this energy is 
supplied by the proton-proton chain, how many protons are consumed per 
second? (b) How many neutrinos per second should there be per square meter at 
the surface of Earth from this process? 


Solution: 


a33c-10" protons/s;b.6 x 10. neutrinos /m? -s 
This huge number is indicative of how rarely a neutrino interacts, since large 
detectors observe very few per day. 


Exercise: 


Problem: 


Another set of reactions that fuses hydrogen into helium in the Sun and 
especially in hotter stars is called the CNO cycle: 


PC+1H > BN+y 
SN > 8C+et +u¢ 
BC+/H3= MN+y7 
MN +1H > FO+y7 
oO = PN er ve 
MN +1H > MC + *He 


This process is a “cycle” because !“C appears at the beginning and end of these 
reactions. Write down the overall effect of this cycle (as done for the proton- 
proton chain in 2e~ + 4'H —> He + 2v, + 6y). Assume that the positrons 
annihilate electrons to form more ¥ rays. 


Exercise: 
Problem: 
(a) Calculate the energy released by the fusion of a 1.00-kg mixture of 


deuterium and tritium, which produces helium. There are equal numbers of 
deuterium and tritium nuclei in the mixture. 


(b) If this process takes place continuously over a period of a year, what is the 
average power output? 


Solution: 


a. The atomic mass of deuterium (7H) is 2.014102 u, while that of tritium (3H) 
is 3.016049 u, for a total of 5.032151 u per reaction. So a mole of reactants has a 
mass of 5.03 g, and in 1.00 kg, there are (1000 g) / (5.03 g/mol) = 198.8 mol 
of reactants. The number of reactions that take place is therefore 

(198.8 mol) (6.02 x 1073 mol™*) = 1.20 x 10° reactions. 

The total energy output is the number of reactions times the energy per reaction: 
BS 330 10s 

b. Power is energy per unit time. One year has 3.16 x 10’s, so 
P=10.7MW. 

We expect nuclear processes to yield large amounts of energy, and this is 
certainly the case here. The energy output of 3.37 x 104 J from fusing 1.00 kg 
of deuterium and tritium is equivalent to 2.6 million gallons of gasoline and 
about eight times the energy output of the bomb that destroyed Hiroshima. Yet 
the average backyard swimming pool has about 6 kg of deuterium in it, so that 
fuel is plentiful if it can be utilized in a controlled manner. 


Glossary 


nuclear fusion 


process of combining lighter nuclei to make heavier nuclei 


nuclear fusion reactor 


nuclear reactor that uses the fusion chain to produce energy 


nucleosynthesis 


process of fusion by which all elements on Earth are believed to have been 
created 


proton-proton chain 


combined reactions that fuse hydrogen nuclei to produce He nuclei 


Medical Applications and Biological Effects of Nuclear Radiation 
By the end of this section, you will be able to: 


e Describe two medical uses of nuclear technology 

e Explain the origin of biological effects due to nuclear radiation 

e List common sources of radiation and their effects 

e Estimate exposure for nuclear radiation using common dosage units 


Nuclear physics is an integral part of our everyday lives ([link]). Radioactive compounds are 
used to identify cancer, study ancient artifacts, and power our cities. Nuclear fusion also 
powers the Sun, the primary source of energy on Earth. The focus of this chapter is nuclear 
radiation. In this section, we ask such questions as: How is nuclear radiation used to benefit 
society? What are its health risks? How much nuclear radiation is the average person 
exposed to in a lifetime? 


Dr. Tori Randall, a curator at the San Diego Museum of 
Man, uses nuclear radiation to study a 500-year-old 
Peruvian child mummy. The origin of this radiation is 
the transformation of one nucleus to another. (credit: 
Samantha A. Lewis, U.S. Navy) 


Medical Applications 


Medical use of nuclear radiation is quite common in today’s hospitals and clinics. One of the 
most important uses of nuclear radiation is the location and study of diseased tissue. This 


application requires a special drug called a radiopharmaceutical. A radiopharmaceutical 
contains an unstable radioactive isotope. When the drug enters the body, it tends to 
concentrate in inflamed regions of the body. (Recall that the interaction of the drug with the 
body does not depend on whether a given nucleus is replaced by one of its isotopes, since 
this interaction is determined by chemical interactions.) Radiation detectors used outside the 
body use nuclear radiation from the radioisotopes to locate the diseased tissue. 
Radiopharmaceuticals are called radioactive tags because they allow doctors to track the 
movement of drugs in the body. Radioactive tags are for many purposes, including the 
identification of cancer cells in the bones, brain tumors, and Alzheimer’s disease ([Link]). 
Radioactive tags are also used to monitor the function of body organs, such as blood flow, 
heart muscle activity, and iodine uptake in the thyroid gland. 


These brain images are produced using a radiopharmaceutical. The colors indicate 
relative metabolic or biochemical activity (red indicates high activity and blue indicates 
low activity). The figure on the left shows the normal brain of an individual and the 
figure on the right shows the brain of someone diagnosed with Alzheimer’s disease. The 
brain image of the normal brain indicates much greater metabolic activity (a larger 
fraction of red and orange areas). (credit: modification of works by National Institutes 
of Health) 


[link] lists some medical diagnostic uses of radiopharmaceuticals, including isotopes and 
typical activity (A) levels. One common diagnostic test uses iodine to image the thyroid, 
since iodine is concentrated in that organ. Another common nuclear diagnostic is the thallium 
scan for the cardiovascular system, which reveals blockages in the coronary arteries and 


examines heart activity. The salt TIC] can be used because it acts like NaCl and follows the 
blood. Note that [link] lists many diagnostic uses for 9°™Tc, where “m” stands for a 

metastable state of the technetium nucleus. This isotope is used in many compounds to image 
the skeleton, heart, lungs, and kidneys. About 80% of all radiopharmaceuticals employ 
°°m Tc because it produces a single, easily identified, 0.142-MeV ¥ ray and has a short 6.0-h 
half-life, which reduces radiation exposure. 


Procedure, 
Isotope 


Brain scan 
99m Te 
15Q (PET) 


Lung scan 


133X@ 


99m'T 


Activity (mCi), where 
1mCi = 3.7 x 10’Bq 


VES 


50 


Vas 


Cardiovascular blood pool 


131] 


99m 'T 


0.2 


Cardiovascular arterial flow 


2017] 


24Na 


3 


7.5 


Diagnostic Uses of Radiopharmaceuticals 


Procedure, 
Isotope 


Thyroid scan 


131] 


1237 


Liver scan 


198 Ay 
(colloid) 


99m'T 
(colloid) 


Bone scan 
85Sr 

99m'T 
Kidney scan 
THe 


99m'T 


Activity (mCi), where 
1mCi = 3.7 x 10’Bq 


0.05 


0.07 


0.1 


0.1 


10 


0.1 


1.5 


The first radiation detectors produced two-dimensional images, like a photo taken from a 
camera. However, a circular array of detectors that can be rotated can be used to produce 
three-dimensional images. This technique is similar to that used in X-ray computed 


tomography (CT) scans. One application of this technique is called single-photon-emission 
CT (SPECT) ([link]). The spatial resolution of this technique is about 1 cm. 


The SPECT machine uses radiopharmaceutical compounds to produce an image of 
the human body. The machine takes advantage of the physics of nuclear beat decays 
and electron-positron collisions. (credit: “Woldo”/Wikimedia Commons) 


Improved image resolution is achieved by a technique known as positron emission 
tomography (PET). This technique use radioisotopes that decay by @* radiation. When a 
positron encounters an electron, these particle annihilate to produce two gamma-ray photons. 
This reaction is represented by 

Equation: 


e'te —2y. 


These y-ray photons have identical 0.511-MeV energies and move directly away from one 
another ([link]). This easily identified decay signature can be used to identify the location of 
the radioactive isotope. Examples of 8*-emitting isotopes used in PET include 

11¢, 3N, 0, and !8F. The nuclei have the advantage of being able to function as tags for 
natural body compounds. Its resolution of 0.5 cm is better than that of SPECT. 


e* +e 
annihilation 


A PET system takes advantage of the two identical y-ray photons 
produced by positron-electron annihilation. These y rays are 
emitted in opposite directions, so that the line along which each 
pair is emitted is determined. 


PET scans are especially useful to examine the brain’s anatomy and function. For example, 
PET scans can be used to monitor the brain’s use of oxygen and water, identify regions of 
decreased metabolism (linked to Alzheimer’s disease), and locate different parts of the brain 
responsible for sight, speech, and fine motor activity 


Note: 
Is it a tumor? View an animation of simplified magnetic resonance imaging (MRI) to see if 
you can tell. Your head is full of tiny radio transmitters (the nuclear spins of the hydrogen 


nuclei of your water molecules). In an MRI unit, these little radios can be made to broadcast 
their positions, giving a detailed picture of the inside of your head. 


Biological Effects 


Nuclear radiation can have both positive and negative effects on biological systems. 
However, it can also be used to treat and even cure cancer. How do we understand these 
effects? To answer this question, consider molecules within cells, particularly DNA 
molecules. 


Cells have long, double-helical DNA molecules containing chemical codes that govern the 
function and processes of the cell. Nuclear radiation can alter the structural features of the 
DNA chain, leading to changes in the genetic code. In human cells, we can have as many as 
a million individual instances of damage to DNA per cell per day. DNA contains codes that 
check whether the DNA is damaged and can repair itself. This repair ability of DNA is vital 
for maintaining the integrity of the genetic code and for the normal functioning of the entire 
organism. It should be constantly active and needs to respond rapidly. The rate of DNA 
repair depends on various factors such as the type and age of the cell. If nuclear radiation 
damages the ability of the cell to repair DNA, the cell can 


1. Retreat to an irreversible state of dormancy (known as senescence); 
2. Commit suicide (known as programmed cell death); or 
3. Progress into unregulated cell division, possibly leading to tumors and cancers. 


Nuclear radiation can harm the human body is many other ways as well. For example, high 
doses of nuclear radiation can cause burns and even hair loss. 


Biological effects of nuclear radiation are expressed by many different physical quantities 
and in many different units. A common unit to express the biological effects of nuclear 
radiation is the rad or radiation dose unit. One rad is equal to 1/100 of a joule of nuclear 
energy deposited per kilogram of tissue, written: 

Equation: 


lrad = 0.01 J/kg. 


For example, if a 50.0-kg person is exposed to nuclear radiation over her entire body and she 
absorbs 1.00 J, then her whole-body radiation dose is 
Equation: 


(1.00 J) / (50.0 kg) = 0.0200 J/kg = 2.00 rad. 


Nuclear radiation damages cells by ionizing atoms in the cells as they pass through the cells 
({link]). The effects of ionizing radiation depend on the dose in rads, but also on the type of 


radiation (alpha, beta, gamma, or X-ray) and the type of tissue. For example, if the range of 
the radiation is small, as it is for a@ rays, then the ionization and the damage created is more 
concentrated and harder for the organism to repair. To account for such affects, we define the 
relative biological effectiveness (RBE). Sample RBE values for several types of ionizing 
nuclear radiation are given in [link]. 


Low 
ionization 
density 


High 
ionization 
density 


The image shows ionization created in cells by a and y radiation. Because of its shorter 
range, the ionization and damage created by a rays is more concentrated and harder for 
the organism to repair. Thus, the RBE for @ rays is greater than the RBE for y rays, 
even though they create the same amount of ionization at the same energy. 


Type and Energy of Radiation RBEU! 
X-rays 1 

7 rays 1 

8 rays greater than 32 keV il 

8 rays less than 32 keV i,7 
Neutrons, thermal to slow (<20 keV) 2-5 


Neutrons, fast (1-10 MeV) 10 (body), 32 (eyes) 


Type and Energy of Radiation RBE!! 


Protons (1-10 MeV) 10 (body), 32 (eyes) 
a rays from radioactive decay 10-20 
Heavy ions from accelerators 10-20 


Relative Biological Effectiveness[1] Values approximate. Difficult to determine. 


A dose unit more closely related to effects in biological tissue is called the roentgen 
equivalent man (rem) and is defined to be the dose (in rads) multiplied by the relative 
biological effectiveness (RBE). Thus, if a person had a whole-body dose of 2.00 rad of 
radiation, the dose in rem would be (2.00 rad) (1) = 2.00 rem for the whole body. If the 
person had a whole-body dose of 2.00 rad of @ radiation, then the dose in rem would be 
(2.00 rad) (20) = 40.0 rem for the whole body. The a rays would have 20 times the effect 
on the person than the y rays for the same deposited energy. The SI equivalent of the rem, 
and the more standard term, is the sievert (Sv) is 

Equation: 


1Sv = 100 rem. 


The RBEs given in [link] are approximate but reflect an understanding of nuclear radiation 
and its interaction with living tissue. For example, neutrons are known to cause more damage 
than ¥ rays, although both are neutral and have large ranges, due to secondary radiation. Any 
dose less than 100 mSv (10 rem) is called a low dose, 0.1 Sv to 1 Sv (10 to 100 rem) is 
called a moderate dose, and anything greater than 1 Sv (100 rem) is called a high dose. It is 
difficult to determine if a person has been exposed to less than 10 mSv. 


Biological effects of different levels of nuclear radiation on the human body are given in 
[link]. The first clue that a person has been exposed to radiation is a change in blood count, 
which is not surprising since blood cells are the most rapidly reproducing cells in the body. 
At higher doses, nausea and hair loss are observed, which may be due to interference with 
cell reproduction. Cells in the lining of the digestive system also rapidly reproduce, and their 
destruction causes nausea. When the growth of hair cells slows, the hair follicles become thin 
and break off. High doses cause significant cell death in all systems, but the lowest doses that 
cause fatalities do so by weakening the immune system through the loss of white blood cells. 


2-5 


4.5 


5-20 


>20 


Effect 

No observable effect. 

Slight to moderate decrease in white blood cell counts. 
Temporary sterility; 0.35 for women, 0.50 for men. 


Significant reduction in blood cell counts, brief nausea and vomiting. 
Rarely fatal. 


Nausea, vomiting, hair loss, severe blood damage, hemorrhage, 
fatalities. 


Lethal to 50% of the population within 32 days after exposure if not 
treated. 


Worst effects due to malfunction of small intestine and blood systems. 
Limited survival. 


Fatal within hours due to collapse of central nervous system. 


Immediate Effects of Radiation (Adults, Whole-Body, Single Exposure){1] Multiply by 100 
to obtain dose in rem. 


Sources of Radiation 


Human are also exposed to many sources of nuclear radiation. A summary of average 
radiation doses for different sources by country is given in [link]. Earth emits radiation due to 
the isotopes of uranium, thorium, and potassium. Radiation levels from these sources depend 
on location and can vary by a factor of 10. Fertilizers contain isotopes of potassium and 
uranium, which we digest in the food we eat. Fertilizers have more than 3000 Bq/kg 
radioactivity, compared to just 66 Bq/kg for Carbon-14. 


Source 


Dose (mSv/y)" 


Australia Germany US World 


Source 

Natural radiation — external 
Cosmic rays 

Soil, building materials 
Radon gas 

Natural radiation — internal 
40K, MC 226Ra 

Artificial radiation 
Medical and dental 


TOTAL 


Dose (mSv/y)!!! 


0.30 


0.40 


0.90 


0.24 


0.80 


2.6 


0.28 


0.40 


1.1 


0.28 


0.90 


3.0 


0.30 


0.30 


2.0 


0.40 


0.53 


3.5 


0.39 


0.48 


1.2 


0.29 


0.40 


2.8 


Background Radiation Sources and Average Doses[1] Multiply by 100 to obtain does in 


mrem/y. 


Medical visits are also a source of nuclear radiation. A sample of common nuclear radiation 


doses is given in [link]. These doses are generally low and can be lowered further with 
improved techniques and more sensitive detectors. With the possible exception of routine 
dental X-rays, medical use of nuclear radiation is used only when the risk-benefit is 
favorable. Chest X-rays give the lowest doses—about 0.1 mSv to the tissue affected, with 
less than 5% scattering into tissues that are not directly imaged. Other X-ray procedures 

range upward to about 10 mSv in a CT scan, and about 5 mSv (0.5 rem) per dental X-ray, 


again both only affecting the tissue imaged. Medical images with radiopharmaceuticals give 


doses ranging from 1 to 5 mSv, usually localized. 


Procedure 


Chest 


Dental 


Skull 


Effective Dose (mSv) 


0.02 


0.01 


0.07 


Procedure Effective Dose (mSv) 


Leg 0.02 
Mammogram 0.40 
Barium enema 7.0 
Upper GI 3.0 
CT head 2.0 
CT abdomen 10.0 


Typical Doses Received During Diagnostic X-Ray Exams 


Example: 

What Mass of !°’Cs Escaped Chernobyl? 

The Chernobyl accident in Ukraine (formerly in the Soviet Union) exposed the surrounding 
population to a large amount of radiation through the decay of 1°’Cs. The initial 
radioactivity level was approximately A = 6.0 MCi. Calculate the total mass of !°’Cs 
involved in this accident. 

Strategy 

The total number of nuclei, N, can be determined from the known half-life and activity of 
137s (30.2 y). The mass can be calculated from N using the concept of a mole. 

Solution 

Solving the equation A = aaee for N gives 


Equation: 


Aty/2 


0.693 


Entering the given values yields 
Equation: 


(6.0 MCi) (30.2 y) 
0.693 


To convert from curies to becquerels and years to seconds, we write 
Equation: 


(6.0 x 10° Ci) (3.7 x 10°° Bq/Ci) (30.2 y) (3.16 x 10’ s/y) 
0.693 


N= — 3.1 x 107. 


One mole of a nuclide “X has a mass of A grams, so that one mole of !°’Cs has a mass of 
137 g. Amole has 6.02 x 107° nuclei. Thus the mass of !°’Cs released was 
Equation: 


137g 26 3 
m= | ————— ]} (3.1 x 10 = 70 x 10°¢g = 70 kg. 
= x | ( ) : : 


Significance 

The mass of !°’Cs involved in the Chernoby] accident is a small material compared to the 
typical amount of fuel used in a nuclear reactor. However, approximately 250 people were 
admitted to local hospitals immediately after the accident, and diagnosed as suffering acute 
radiation syndrome. They received external radiation dosages between 1 and 16 Sv. 
Referring to biological effects in [link], these dosages are extremely hazardous. The 
eventual death toll is estimated to be around 4000 people, primarily due to radiation-induced 
cancer. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Radiation propagates in all directions from its source, 
much as electromagnetic radiation from a light bulb. Is activity concept more analogous 
to power, intensity, or brightness? 


Solution: 


power 


Summary 


¢ Nuclear technology is used in medicine to locate and study diseased tissue using special 
drugs called radiopharmaceuticals. Radioactive tags are used to identify cancer cells in 
the bones, brain tumors, and Alzheimer’s disease, and to monitor the function of body 
organs, such as blood flow, heart muscle activity, and iodine uptake in the thyroid gland. 

e The biological effects of ionizing radiation are due to two effects it has on cells: 
interference with cell reproduction and destruction of cell function. 

¢ Common sources of radiation include that emitted by Earth due to the isotopes of 
uranium, thorium, and potassium; natural radiation from cosmic rays, soils, and building 
materials, and artificial sources from medical and dental diagnostic tests. 


¢ Biological effects of nuclear radiation are expressed by many different physical 
quantities and in many different units, including the rad or radiation dose unit. 


Key Equations 


Atomic mass number 
Standard format for expressing an isotope 


Nuclear radius, where rg is the radius of a 
single proton 


Mass defect 

Binding energy 

Binding energy per nucleon 
Radioactive decay rate 
Radioactive decay law 


Decay constant 


Lifetime of a substance 


Activity of a radioactive substance 


Activity of a radioactive substance (linear 
form) 


Alpha decay 
Beta decay 
Positron emission 


Gamma decay 


A=Z4IN 
ax 
r=r Al 


E = (Am)e? 
BEN = 
—_" = )\N 
N = Noe 
= "es 
r=} 

A= Ape 


In A = —At+1n Ao 
Ax — 45% + $He 
4X > 7, AX+ fe+v 
4X > , AX4+ Se+v 


ax*— AX +7 


Conceptual Questions 


Exercise: 


Problem: Why is a PET scan more accurate than a SPECT scan? 
Exercise: 
Problem: 


Isotopes that emit a radiation are relatively safe outside the body and exceptionally 
hazardous inside. Explain why. 


Solution: 


Alpha particles do not penetrate materials such as skin and clothes easily. (Recall that 
alpha radiation is barely able to pass through a thin sheet of paper.) However, when 
produce inside the body, neighboring cells are vulnerable. 


Exercise: 
Problem: 


Ionizing radiation can impair the ability of a cell to repair DNA. What are the three 
ways the cell can respond? 


Problems 


Exercise: 
Problem: 
What is the dose in mSv for: (a) a 0.1-Gy X-ray? (b) 2.5 mGy of neutron exposure to 
the eye? (c) 1.5m Gy of a exposure? 
Exercise: 
Problem: 
Find the radiation dose in Gy for: (a) A 10-mSv fluoroscopic X-ray series. (b) 50 mSv 


of skin exposure by an a emitter. (c) 160 mSv of @~ and ¥ rays from the “°K in your 
body. 


Solution: 


Gy = ee a. 0.01 Gy; b. 0.0025 Gy; c. 0.16 Gy 


Exercise: 


Problem: Find the mass of 2°?Pu that has an activity of 1.00 pCi. 
Exercise: 
Problem: 
In the 1980s, the term picowave was used to describe food irradiation in order to 


overcome public resistance by playing on the well-known safety of microwave 
radiation. Find the energy in MeV of a photon having a wavelength of a picometer. 


Solution: 


1.24 MeV 
Exercise: 
Problem: 
What is the dose in Sv in a cancer treatment that exposes the patient to 200 Gy of y 
rays? 
Exercise: 
Problem: 
One half the + rays from 9°™Tc are absorbed by a 0.170-mm-thick lead shielding. Half 


of the y rays that pass through the first layer of lead are absorbed in a second layer of 
equal thickness. What thickness of lead will absorb all but one in 1000 of these y rays? 


Solution: 


1.69 mm 
Exercise: 
Problem: 
How many Gy of exposure is needed to give a cancerous tumor a dose of 40 Sv if it is 
exposed to @ activity? 
Exercise: 
Problem: 
A plumber at a nuclear power plant receives a whole-body dose of 30 mSv in 15 


minutes while repairing a crucial valve. Find the radiation-induced yearly risk of death 
from cancer and the chance of genetic defect from this maximum allowable exposure. 


Solution: 


For cancer: (3 rem) ( 10 ) — 0 The risk each year of dying from induced 


10°rem-y 10°y ’ 
cancer is 30 in a million. For genetic defect: (3 rem) ( 23 ) — 2 | The chance 
10°rem-y 10°y 
each year of an induced genetic defect is 10 in a million. 
Exercise: 
Problem: 


Calculate the dose in rem/y for the lungs of a weapons plant employee who inhales and 
retains an activity of 1.00Ci 72°Pu in an accident. The mass of affected lung tissue is 
2.00 kg and the plutonium decays by emission of a 5.23-MeV a particle. Assume a 
RBE value of 20. 


Additional Problems 


Exercise: 
Problem: 
The wiki-phony site states that the atomic mass of chlorine is 40 g/mol. Check this 


result. Hint: The two, most common stable isotopes of chlorine are: 3Cl and tC, (The 
abundance of Cl-35 is 75.8%, and the abundance of C]-37 is 24.2%.) 


Solution: 


atomic mass (Cl) = 35.5 g/mol 
Exercise: 
Problem: 


A particle physicist discovers a neutral particle with a mass of 2.02733 u that he 
assumes is two neutrons bound together. 


(a) Find the binding energy. 


(b) What is unreasonable about this result? 
Exercise: 
Problem: 
A nuclear physicist finds 1.0 yg of 7°°U in a piece of uranium ore (Ty/7 = 
2.348 X 10’y). (a) Use the decay law to determine how much 7°°U would had to have 


been on Earth when it formed 4.543 x 10°y ago for 1.0 ug to be left today. (b) What 
is unreasonable about this result? (c) How is this unreasonable result resolved? 


Solution: 


a. 1.71 x 10°%kg; b. This mass is impossibly large; it is greater than the mass of the 
entire Milky Way galaxy. c. °°U is not produced through natural processes operating 
over long times on Earth, but through artificial processes in a nuclear reactor. 


Exercise: 
Problem: 
A group of scientists use carbon dating to date a piece of wood to be 3 billion years old. 
Why doesn’t this make sense? 
Exercise: 
Problem: 
According to your lab partner, a 2.00-cm-thick sodium-iodide crystal absorbs all but 


10% of rays from a radioactive source and a 4.00-cm piece of the same material absorbs 
all but 5%? Is this result reasonable? 


Solution: 


If 10% of rays are left after 2.00 cm, then only (0.100)? = 0.01 = 1% are left after 
4.00 cm. This is much smaller than your lab partner’s result (5%). 


Exercise: 
Problem: 
In the science section of the newspaper, an article reports the efforts of a group of 


scientists to create a new nuclear reactor based on the fission of iron (Fe). Is this a good 
idea? 


Exercise: 


Problem: 


The ceramic glaze on a red-orange “Fiestaware” plate is U2Ox3 and contains 50.0 grams 
of 23°U, but very little 2°°U. (a) What is the activity of the plate? (b) Calculate the total 
energy that will be released by the 7°°U decay. (c) If energy is worth 12.0 cents per 
kW - h, what is the monetary value of the energy emitted? (These brightly-colored 
ceramic plates went out of production some 30 years ago, but are still available as 
collectibles.) 


Solution: 


a. 1.68 x 10° Ci; (b) From Appendix B, the energy released per decay is 4.27 MeV, 
so 8.65 x 101° J; (c) The monetary value of the energy is $2.9 x 10° 


Exercise: 


Problem: 


Large amounts of depleted uranium (720) are available as a by-product of uranium 
processing for reactor fuel and weapons. Uranium is very dense and makes good 
counter weights for aircraft. Suppose you have a 4000-kg block of ?3°U. (a) Find its 
activity. (b) How many calories per day are generated by thermalization of the decay 
energy? (c) Do you think you could detect this as heat? Explain. 


Exercise: 
Problem: 
A piece of wood from an ancient Egyptian tomb is tested for its carbon-14 activity. It is 


found to have an activity per gram of carbon of A = 10 decay /min - g. What is the age 
of the wood? 


Solution: 


We know that A = 3.84 x 10-!2s~! and 
Ao = 0.25 decays/s - g = 15 decays/min - g. 
Thus, the age of the tomb is 


—__ 1 id, 10 decays/min-g __ nigeee 
i= Pale In ices ae 1.06 x 10° s + 3350 y. 


Challenge Problems 


Exercise: 


Problem: 


This problem demonstrates that the binding energy of the electron in the ground state of 
a hydrogen atom is much smaller than the rest mass energies of the proton and electron. 


(a) Calculate the mass equivalent in u of the 13.6-eV binding energy of an electron in a 
hydrogen atom, and compare this with the known mass of the hydrogen atom. 


(b) Subtract the known mass of the proton from the known mass of the hydrogen atom. 


(c) Take the ratio of the binding energy of the electron (13.6 eV) to the energy 
equivalent of the electron’s mass (0.511 MeV). 


(d) Discuss how your answers confirm the stated purpose of this problem. 


Exercise: 


Problem: 


The Galileo space probe was launched on its long journey past Venus and Earth in 
1989, with an ultimate goal of Jupiter. Its power source is 11.0 kg of ?°Pu, a by- 
product of nuclear weapons plutonium production. Electrical energy is generated 
thermoelectrically from the heat produced when the 5.59-MeV a particles emitted in 
each decay crash to a halt inside the plutonium and its shielding. The half-life of 77°Pu 
is 87.7 years. 


(a) What was the original activity of the 2°*Pu in becquerels? 
(b) What power was emitted in kilowatts? 


(c) What power was emitted 12.0 y after launch? You may neglect any extra energy 
from daughter nuclides and any losses from escaping ¥ rays. 


Solution: 


a. 6.97 x 10!° Bq; b. 6.24 kW; c. 5.67 kw 


Exercise: 


Problem: Find the energy emitted in the 8~ decay of °°Co. 
Exercise: 


Problem: 


Engineers are frequently called on to inspect and, if necessary, repair equipment in 
nuclear power plants. Suppose that the city lights go out. After inspecting the nuclear 
reactor, you find a leaky pipe that leads from the steam generator to turbine chamber. (a) 
How do the pressure readings for the turbine chamber and steam condenser compare? 
(b) Why is the nuclear reactor not generating electricity? 


Solution: 


a. Due to the leak, the pressure in the turbine chamber has dropped significantly. The 
pressure difference between the turbine chamber and steam condenser is now very low. 
b. A large pressure difference is required for steam to pass through the turbine chamber 
and turn the turbine. 


Exercise: 


Problem: 


If two nuclei are to fuse in a nuclear reaction, they must be moving fast enough so that 
the repulsive Coulomb force between them does not prevent them for getting within 
R= 10~“m of one another. At this distance or nearer, the attractive nuclear force can 
overcome the Coulomb force, and the nuclei are able to fuse. 


(a) Find a simple formula that can be used to estimate the minimum kinetic energy the 
nuclei must have if they are to fuse. To keep the calculation simple, assume the two 
nuclei are identical and moving toward one another with the same speed v. (b) Use this 
minimum kinetic energy to estimate the minimum temperature a gas of the nuclei must 
have before a significant number of them will undergo fusion. Calculate this minimum 
temperature first for hydrogen and then for helium. (Hint: For fusion to occur, the 
minimum kinetic energy when the nuclei are far apart must be equal to the Coulomb 
potential energy when they are a distance R apart.) 


Exercise: 
Problem: 
For the reaction, n + *He — “He + 4, find the amount of energy transfers to “He and 


7 (on the right side of the equation). Assume the reactants are initially at rest. (Hint: 
Use conservation of momentum principle.) 


Solution: 
The energies are 
E, = 20.6 MeV 


. Notice that most of the ener oes to the ¥ ray. 
Evge = 5.68 x 10-2 MeV oe oe 


Exercise: 
Problem: 
Engineers are frequently called on to inspect and, if necessary, repair equipment in 
medical hospitals. Suppose that the PET system malfunctions. After inspecting the unit, 
you suspect that one of the PET photon detectors is misaligned. To test your theory you 
position one detector at the location (r, 0, y) = (1.5, 45, 30) relative to a radioactive 


test sample at the center of the patient bed. (a) If the second photon detector is properly 
aligned where should it be located? (b) What energy reading is expected? 


Glossary 


high dose 
dose of radiation greater than 1 Sv (100 rem) 


low dose 


dose of radiation less than 100 mSv (10 rem) 


moderate dose 
dose of radiation from 0.1 Sv to 1 Sv (10 to 100 rem) 


positron emission tomography (PET) 
tomography technique that uses @* emitters and detects the two annihilation ¥ rays, 
aiding in source localization 


radiation dose unit (rad) 
ionizing energy deposited per kilogram of tissue 


radioactive tags 
special drugs (radiopharmaceuticals) that allow doctors to track movement of other 
drugs in the body 


radiopharmaceutical 
compound used for medical imaging 


relative biological effectiveness (RBE) 
number that expresses the relative amount of damage that a fixed amount of ionizing 
radiation of a given type can inflict on biological tissues 


roentgen equivalent man (rem) 
dose unit more closely related to effects in biological tissue 


sievert (Sv) 
SI equivalent of the rem 


single-photon-emission computed tomography (SPECT) 
tomography performed with 7-emitting radiopharmaceuticals 


Introduction 
class="introduction" 


The Large 
Hadron 
Collider 
(HG) is 

located over 

150 meters 

(500 feet) 

underground 
on the border 
of 

Switzerland 

and France 

near Geneva, 
Switzerland. 

The LHC is 
the most 
powerful 

machine ever 
developed to 
test our 
understandin 
g of 
elementary 
particle 
interactions. 
Shown here 
is the ATLAS 
detector, 
which helps 
identify new 
particles 
formed in 
collisions. 
(credit: 


modification 
of work by 
Maximilien 
Brice, 
CERN) 


At the very beginning of this text we discussed the wide range of scales that 
physics encompasses, from the very smallest particles to the largest scale 
possible—the universe itself. In this final chapter we examine some of the 
frontiers of research at these extreme scales. Particle physics deals with the 
most basic building blocks of matter and the forces that hold them together. 
Cosmology is the study of the stars, galaxies, and galactic structures that 
populate our universe, as well as their past history and future evolution. 


These two areas of physics are not as disconnected as you might think. The 
study of elementary particles requires enormous energies to produce 
isolated particles, involving some of the largest machines humans have ever 
built. But such high energies were present in the earliest stages of the 
universe and the universe we see around us today was shaped in part by the 
nature and interactions of the elementary particles created then. Bear in 
mind that particle physics and cosmology are both areas of intense current 
research, subject to much speculation on the part of physicists (as well 
science-fiction writers). In this chapter we try to emphasize what is known 


on the basis of deductions from observational evidence, and identify ideas 
that are conjectured but still unproven. 


Introduction to Particle Physics 
By the end of this section, you will be able to: 


e Describe the four fundamental forces and what particles participate in them 

e Identify and describe fermions and bosons 

¢ Identify and describe the quark and lepton families 

Distinguish between particles and antiparticles, and describe their interactions 


Elementary particle physics is the study of fundamental particles and their interactions in 
nature. Those who study elementary particle physics—the particle physicists—differ from 
other physicists in the scale of the systems that they study. A particle physicist is not content to 
study the microscopic world of cells, molecules, atoms, or even atomic nuclei. They are 
interested in physical processes that occur at scales even smaller than atomic nuclei. At the 
same time, they engage the most profound mysteries in nature: How did the universe begin? 
What explains the pattern of masses in the universe? Why is there more matter than antimatter 
in the universe? Why are energy and momentum conserved? How will the universe evolve? 


Four Fundamental Forces 


An important step to answering these questions is to understand particles and their interactions. 
Particle interactions are expressed in terms of four fundamental forces. In order of decreasing 
strength, these forces are the strong nuclear force, the electromagnetic force, the weak 
nuclear force, and the gravitational force. 


1. Strong nuclear force. The strong nuclear force is a very strong attractive force that acts 
only over very short distances (about 10~*° m). The strong nuclear force is responsible for 
binding protons and neutrons together in atomic nuclei. Not all particles participate in the 
strong nuclear force; for instance, electrons and neutrinos are not affected by it. As the 
name suggests, this force is much stronger than the other forces. 

2. Electromagnetic force. The electromagnetic force can act over very large distances (it has 
an infinite range) but is only 1/100 the strength of the strong nuclear force. Particles that 
interact through this force are said to have “charge.” In the classical theory of static 
electricity (Coulomb’s law), the electric force varies as the product of the charges of the 
interacting particles, and as the inverse square of the distances between them. In contrast 
to the strong force, the electromagnetic force can be attractive or repulsive (opposite 
charges attract and like charges repel). The magnetic force depends in a more complicated 
way on the charges and their motions. The unification of the electric and magnetic force 
into a single electromagnetic force (an achievement of James Clerk Maxwell) stands as 
one of the greatest intellectual achievements of the nineteenth century. This force is central 
to scientific models of atomic structure and molecular bonding. 

3. Weak nuclear force. The weak nuclear force acts over very short distances (10~*° m) 
and, as its name suggest, is very weak. It is roughly 10~° the strength of the strong nuclear 
force. This force is manifested most notably in decays of elementary particles and neutrino 
interactions. For example, the neutron can decay to a proton, electron, and electron 
neutrino through the weak force. The weak force is vitally important because it is essential 


for understanding stellar nucleosynthesis—the process that creates new atomic nuclei in 
the cores of stars. 

4. Gravitational force. Like the electromagnetic force, the gravitational force can act over 
infinitely large distances; however, it is only 10~°° as strong as the strong nuclear force. 
In Newton’s classical theory of gravity, the force of gravity varies as the product of the 
masses of the interacting particles and as the inverse square of the distance between them. 
This force is an attractive force that acts between all particles with mass. In modern 
theories of gravity, this force behavior is considered a special case for low-energy 
macroscopic interactions. Compared with the other forces of nature, gravity is by far the 
weakest. 


The fundamental forces may not be truly “fundamental” but may actually be different aspects 
of the same force. Just as the electric and magnetic forces were unified into an electromagnetic 
force, physicists in the 1970s unified the electromagnetic force with the weak nuclear force into 
an electroweak force. Any scientific theory that attempts to unify the electroweak force and 
strong nuclear force is called a grand unified theory, and any theory that attempts to unify all 
four forces is called a theory of everything. We will return to the concept of unification later in 
this chapter. 


Classifications of Elementary Particles 


A large number of subatomic particles exist in nature. These particles can be classified in two 
ways: the property of spin and participation in the four fundamental forces. Recall that the spin 
of a particle is analogous to the rotation of a macroscopic object about its own axis. These types 
of classification are described separately below. 


Classification by spin 


Particles of matter can be divided into fermions and bosons. Fermions have half-integral spin 
(5h, 2h, ...) and bosons have integral spin (Of, 1h, 2H, ...). Familiar examples of 
fermions are electrons, protons, and neutrons. A familiar example of a boson is a photon. 
Fermions and bosons behave very differently in groups. For example, when electrons are 
confined to a small region of space, Pauli’s exclusion principle states that no two electrons can 
occupy the same quantum-mechanical state. However, when photons are confined to a small 
region of space, there is no such limitation. 


The behavior of fermions and bosons in groups can be understood in terms of the property of 
indistinguishability. Particles are said to be “indistinguishable” if they are identical to one 
another. For example, electrons are indistinguishable because every electron in the universe has 
exactly the same mass and spin as all other electrons—“when you’ ve seen one electron, you’ve 
seen them all.” If you switch two indistinguishable particles in the same small region of space, 
the square of the wave function that describes this system and can be measured (|7)|”) is 
unchanged. If this were not the case, we could tell whether or not the particles had been 
switched and the particle would not be truly indistinguishable. Fermions and bosons differ by 
whether the sign of the wave function (w)— not directly observable—flips: 


Equation: 


y — —vw (indistinguishable fermions), 
yw — + (indistinguishable bosons). 


Fermions are said to be “antisymmetric on exchange” and bosons are “symmetric on 
exchange.” Pauli’s exclusion principle is a consequence of exchange symmetry of fermions— 
a connection developed in a more advanced course in modern physics. The electronic structure 
of atoms is predicated on Pauli’s exclusion principle and is therefore directly related to the 
indistinguishability of electrons. 


Classification by force interactions 


Fermions can be further divided into quarks and leptons. The primary difference between 
these two types of particles is that quarks interact via the strong force and leptons do not. 
Quarks and leptons (as well as bosons to be discussed later) are organized in [link]. The upper 
two rows (first three columns in purple) contain six quarks. These quarks are arranged into two 
particle families: up, charm, and top (u, c, t), and down, strange, and bottom (d, s, b). Members 
of the same particle family share the same properties but differ in mass (given in MeV / c’). For 
example, the mass of the top quark is much greater than the charm quark, and the mass of the 
charm quark is much greater than the up quark. All quarks interact with one another through 
the strong nuclear force. 
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The families of subatomic particles, categorized by the types of forces with which 
they interact. (credit: modification of work by “MissMJ”/Wikimedia Commons) 


Leptons 


Ordinary matter consists of two types of quarks: the up quark (elementary charge, g = +2/3) 
and the down quark (q¢ = —1/3). Heavier quarks are unstable and quickly decay to lighter ones 
via the weak force. Quarks bind together in groups of twos and threes called hadrons via the 
strong force. Hadrons that consist of two quarks are called mesons, and those that consist of 
three quarks are called baryons. Examples of mesons include the pion and kaon, and examples 
of baryons include the familiar proton and neutron. A proton is two up quarks and a down 
quark (p = uud, g = +1) and a neutron is one up quark and two down quarks (n = udd, 

q = 0). Properties of sample mesons and baryons are given in [link]. Quarks participate in all 
four fundamental forces: strong, weak, electromagnetic, and gravitational. 


The lower two rows in the figure (in green) contain six leptons arranged into two particle 
families: electron, muon, and tau (e, 4, 7), and electron neutrino, muon neutrino, and tau 
neutrino (Ve, Vy, Ur). The muon is over 200 times heavier than an electron, but is otherwise 
similar to the electron. The tau is about 3500 times heavier than the electron, but is otherwise 
similar to the muon and electron. Once created, the muon and tau quickly decay to lighter 
particles via the weak force. Leptons do not participate in the strong force. Quarks and leptons 


will be discussed later in this chapter. Leptons participate in the weak, electromagnetic, and 
gravitational forces, but do not participate in the strong force. 


Bosons (shown in red) are the force carriers of the fermions. In this model, leptons and quarks 
interact with each other by sending and receiving bosons. For example, Coulombic interaction 
occurs when two positively charged particles send and receive (exchange) photons. The 
photons are said to “carry” the force between charged particles. Likewise, attraction between 
two quarks in an atomic nucleus occurs when two quarks send and receive gluons. Additional 
examples include W and Z bosons (which carry weak nuclear force) and gravitons (which 
carry gravitational force). The Higgs boson is a special particle: When it interacts with other 
particles, it endows them not with force but with mass. In other words, the Higgs boson helps to 
explains why particles have mass. These assertions are part of a tentative but very productive 
scientific model (the Standard Model) discussed later. 


Particles and Antiparticles 


In the late 1920s, the special theory of relativity and quantum mechanics were combined into a 
relativistic quantum theory of the electron. A surprising result of this theory was the prediction 
of two energy states for each electron: One is associated with the electron, and the other is 
associated with another particle with the same mass of an electron but with a charge of e*. This 
particle is called the antielectron or positron. The positron was discovered experimentally in 
the 1930s. 


Soon it was discovered that for every particle in nature, there is a corresponding antiparticle. 
An antiparticle has the same mass and lifetime as its associated particle, and the opposite sign 
of electric charge. These particles are produced in high-energy reactions. Examples of high- 


energy particles include the antimuon (y:*), anti-up quark (w), and anti-down quark (d). (Note 
that antiparticles for quarks are designated with an over-bar.) Many mesons and baryons 


contain antiparticles. For example, the antiproton (p) is wud and the positively charged pion 


(x*) is ud. Some neutral particles, such as the photon and the z° meson, are their own 
antiparticles. Sample particles, antiparticles, and their properties are listed in [link]. 


Particle Mass Average 
name Symbol Antiparticle (MeV/c?) lifetime (s) 


Leptons 


Electron e” er 0.511 Stable 


Hadrons 


Baryons 


Mesons 


Particle 
name 


Electron 
neutrino 


Muon 


Muon 
neutrino 


Tau 


Tau 
neutrino 


Proton 
Neutron 
Lambda 


Sigma 


Jp 


Upsilon 


Symbol 


Ur 


Antiparticle 


Average 
lifetime (s) 


Stable 

2.20 x 10°6 
Stable 

<4 x 10°78 
Stable 

Stable 

920 

2.6 x 1071° 
0.80 x 10~1° 
2.9 x 1071° 
0.82 x 10710 
2.60 x 10-8 
0.83 x 10-16 
1.24 x 1078 
0.89 x 10~1° 
5.2 x 10-8 

(12X02 
1D 10-8 


Particles and their Properties 


The same forces that hold ordinary matter together also hold antimatter together. Under the 
right conditions, it is possible to create antiatoms such as antihydrogen, antioxygen, and even 
antiwater. In antiatoms, positrons orbit a negatively charged nucleus of antiprotons and 
antineutrons. [link] compares atoms and antiatoms. 
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Helium Antihelium 
(b) 


A comparison of the simplest atoms of matter and 
antimatter. (a) In the Bohr model, an antihydrogen atom 
consists of a positron that orbits an antiproton. (b) An 
antihelium atom consists of two positrons that orbit a 
nucleus of two antiprotons and two antineutrons. 


Antimatter cannot exist for long in nature because particles and antiparticles annihilate each 
other to produce high-energy radiation. A common example is electron-positron annihilation. 
This process proceeds by the reaction 

Equation: 


e tet — 24. 


The electron and positron vanish completely and two photons are produced in their place. (It 
turns out that the production of a single photon would violate conservation of energy and 
momentum.) This reaction can also proceed in the reverse direction: Two photons can 
annihilate each other to produce an electron and positron pair. Or, a single photon can produce 
an electron-positron pair in the field of a nucleus, a process called pair production. Reactions of 
this kind are measured routinely in modern particle detectors. The existence of antiparticles in 
nature is not science fiction. 


Note: 
Watch this video to learn more about matter and antimatter particles. 


Summary 


e The four fundamental forces of nature are, in order of strength: strong nuclear, 
electromagnetic, weak nuclear, and gravitational. Quarks interact via the strong force, but 
leptons do not. Both quark and leptons interact via the electromagnetic, weak, and 
gravitational forces. 

e Elementary particles are classified into fermions and boson. Fermions have half-integral 
spin and obey the exclusion principle. Bosons have integral spin and do not obey this 
principle. Bosons are the force carriers of particle interactions. 

¢ Quarks and leptons belong to particle families composed of three members each. Members 
of a family share many properties (charge, spin, participation in forces) but not mass. 

e All particles have antiparticles. Particles share the same properties as their antimatter 
particles, but carry opposite charge. 


Conceptual Questions 


Exercise: 


Problem: What are the four fundamental forces? Briefly describe them. 
Solution: 


Strong nuclear force: interaction between quarks, mediated by gluons. Electromagnetic 
force: interaction between charge particles, mediated photons. Weak nuclear force: 
interactions between fermions, mediated by heavy bosons. Gravitational force: 
interactions between material (massive) particle, mediate by hypothetical gravitons. 


Exercise: 
Problem: 


Distinguish fermions and bosons using the concepts of indistiguishability and exchange 
symmetry. 


Exercise: 


Problem: List the quark and lepton families 
Solution: 


electron, muon, tau; electron neutrino, muon neutrino, tau neutrino; down quark, strange 
quark, bottom quark; up quark, charm quark, top quark 


Exercise: 


Problem: 


Distinguish between elementary particles and antiparticles. Describe their interactions. 


Problems 


Exercise: 


Problem: 


How much energy is released when an electron and a positron at rest annihilate each 
other? (For particle masses, see [link].) 


Solution: 


1.022 MeV 
Exercise: 


Problem: 


If 1.0 x 10°? MeV of energy is released in the annihilation of a sphere of matter and 
antimatter, and the spheres are equal mass, what are the masses of the spheres? 


Exercise: 


Problem: 


When both an electron and a positron are at rest, they can annihilate each other according 
to the reaction 


e +et >y+y7. 
In this case, what are the energy, momentum, and frequency of each photon? 
Solution: 


0.511 MeV, 2.73 x 10-7? kg- m/s, 1.23 x 107° Hz 


Exercise: 


Problem: 


What is the total kinetic energy carried away by the particles of the following decays? 


Glossary 


antiparticle 
subatomic particle with the same mass and lifetime as its associated particle, but opposite 
electric charge 


baryons 
group of three quarks 


boson 
particle with integral spin that are symmetric on exchange 


electroweak force 
unification of electromagnetic force and weak-nuclear force interactions 


exchange symmetry 
property of a system of indistinguishable particles that requires the exchange of any two 
particles to be unobservable 


fermion 
particle with half-integral spin that is antisymmetric on exchange 


fundamental force 
one of four forces that act between bodies of matter: the strong nuclear, electromagnetic, 
weak nuclear, and gravitational forces 


gluon 
particle that that carry the strong nuclear force between quarks within an atomic nucleus 


grand unified theory 
theory of particle interactions that unifies the strong nuclear, electromagnetic, and weak 
nuclear forces 


hadron 
a meson or baryon 


lepton 


a fermion that participates in the electroweak force 


mesons 
a group of two quarks 


quark 
a fermion that participates in the electroweak and strong nuclear force 


positron 
antielectron 


strong nuclear force 
relatively strong attractive force that acts over short distances (about 10~1° m) responsible 
for binding protons and neutrons together in atomic nuclei 


theory of everything 
a theory of particle interactions that unifies all four fundamental forces 


W and Z boson 
particle with a relatively large mass that carries the weak nuclear force between leptons 
and quarks 


weak nuclear force 
relative weak force (about 10~° the strength of the strong nuclear force) responsible for 
decays of elementary particles and neutrino interactions 


Particle Conservation Laws 
By the end of this section, you will be able to: 


e Distinguish three conservation laws: baryon number, lepton number, and strangeness 

¢ Use rules to determine the total baryon number, lepton number, and strangeness of particles 
before and after a reaction 

¢ Use baryon number, lepton number, and strangeness conservation to determine if particle 
reactions or decays occur 


Conservation laws are critical to an understanding of particle physics. Strong evidence exists that 
energy, Momentum, and angular momentum are all conserved in all particle interactions. The 
annihilation of an electron and positron at rest, for example, cannot produce just one photon 
because this violates the conservation of linear momentum. As discussed in Relativity, the 
special theory of relativity modifies definitions of momentum, energy, and other familiar 
quantities. In particular, the relativistic momentum of a particle differs from its classical 


momentum by a factor y = 1/4/1— (v/c)? that varies from 1 to 00, depending on the speed of 
the particle. 


In previous chapters, we encountered other conservation laws as well. For example, charge is 
conserved in all electrostatic phenomena. Charge lost in one place is gained in another because 
charge is carried by particles. No known physical processes violate charge conservation. In the 
next section, we describe three less-familiar conservation laws: baryon number, lepton number, 
and strangeness. These are by no means the only conservation laws in particle physics. 


Baryon Number Conservation 


No conservation law considered thus far prevents a neutron from decaying via a reaction such as 
Equation: 


n—o>e'+e. 


This process conserves charge, energy, and momentum. However, it does not occur because it 
violates the law of baryon number conservation. This law requires that the total baryon number 
of a reaction is the same before and after the reaction occurs. To determine the total baryon 
number, every elementary particle is assigned a baryon number B. The baryon number has the 
value B = +1 for baryons, —1 for antibaryons, and 0 for all other particles. Returning to the 
above case (the decay of the neutron into an electron-positron pair), the neutron has a value 

B = +1, whereas the electron and the positron each has a value of 0. Thus, the decay does not 
occur because the total baryon number changes from 1 to 0. However, the proton-antiproton 
collision process 

Equation: 


ptp—>p+p+t+p+tp, 


does satisfy the law of conservation of baryon number because the baryon number is zero before 
and after the interaction. The baryon number for several common particles is given in [link]. 


Lepton Lepton Lepton Baryon Strange- 
Particle number number number number ness 
name Symbol (Le) (L,.) (L-) (B) number 
Electron e- 1 0 0 0 0 
Electron , 1 0 0 0 0 
neutrino . 
Muon i 0 1 0 0 0 
nae Vy, 0 1 0 0 0 
neutrino 
Tau Tr 0 0 1 0 0 
ae U, 0 0 1 0 0 
neutrino 
Pion wt 0 0 0 0 0 
Positive Kt 0 0 0 0 1 
kaon 
Negative K- 0 0 0 0 1 
kaon 
Proton p 0 0 0 1 0 
Neutron n 0 0 0 1 0 
Pannen | Ao 0 0 0 1 ai 
Zero 
Positive mt 0 0 0 1 1 
sigma 
Negative > 0 0 0 1 1 


sigma 


Lepton Lepton Lepton Baryon Strange- 


Particle number number number number ness 
name Symbol (Le) (L,,) (L-) (B) number 
Xi zero = 0 0 0 1 a5 
peeanve. Wee 0 0 0 1 5 

xi 

Omega Q- 0 0 0 1 3 


Conserved Properties of Particles 


Example: 

Baryon Number Conservation 

Based on the law of conservation of baryon number, which of the following reactions can occur? 
Equation: 


(a)a- +p nr 4+n4+a 427 
(b)p+p—>pt+ptp 


Strategy 

Determine the total baryon number for the reactants and products, and require that this value 
does not change in the reaction. Solution 

For reaction (a), the net baryon number of the two reactants is 0 + 1 = 1 and the net baryon 
number of the four products is 0 + 1 + 0 + 0 = 1. Since the net baryon numbers of the 
reactants and products are equal, this reaction is allowed on the basis of the baryon number 
conservation law. 

For reaction (b), the net baryon number of the reactants is 1 + (—1) = 0 and the net baryon 
number of the proposed products is 1 + 1 + (—1) = 1. Since the net baryon numbers of the 
reactants and proposed products are not equal, this reaction cannot occur. 

Significance 

Baryon number is conserved in the first reaction, but not in the second. Baryon number 
conservation constrains what reactions can and cannot occur in nature. 


Note: 
Exercise: 


Problem: Check Your Understanding What is the baryon number of a hydrogen nucleus? 


Solution: 


Lepton Number Conservation 


Lepton number conservation states that the sum of lepton numbers before and after the 
interaction must be the same. There are three different lepton numbers: the electron-lepton 
number L,, the muon-lepton number L,,, and the tau-lepton number L. In any interaction, each 
of these quantities must be conserved separately. For electrons and electron neutrinos, DL. = 1; 
for their antiparticles, Le = —1; all other particles have L, = 0. Similarly, L,, = 1 for muons 
and muon neutrinos, L,, = —1 for their antiparticles, and L,, = 0 for all other particles. Finally, 
L, = 1, —1, or 0, depending on whether we have a tau or tau neutrino, their antiparticles, or any 
other particle, respectively. Lepton number conservation guarantees that the number of electrons 
and positrons in the universe stays relatively constant. (Note: The total lepton number is, as far as 
we know, conserved in nature. However, observations have shown variations of family lepton 
number (for example, LZ.) in a phenomenon called neutrino oscillations.) 


To illustrate the lepton number conservation law, consider the following known two-step decay 
process: 


Equation: 
n> ptt yy 
pt oet+uRe+ Vy. 
In the first decay, all of the lepton numbers for 7* are 0. For the products of this decay, L, = —1 


for ut and L,, = 1 for v,. Therefore, muon-lepton number is conserved. Neither electrons nor 
tau are involved in this decay, so L, = 0 and L, = 0 for the initial particle and all decay 
products. Thus, electron-lepton and tau-lepton numbers are also conserved. In the second decay, 
py has a muon-lepton number L,, = —1, whereas the net muon-lepton number of the decay 
products is 0 + 0 + (—1) =—1. Thus, the muon-lepton number is conserved. Electron-lepton 
number is also conserved, as L, = 0 for y*, whereas the net electron-lepton number of the decay 
products is (—1) + 1+ 0 = 0. Finally, since no taus or tau-neutrinos are involved in this decay, 
the tau-lepton number is also conserved. 


Example: 

Lepton Number Conservation 

Based on the law of conservation of lepton number, which of the following decays can occur? 
Equation: 


(ja pee 
(a ee, 


Strategy 

Determine the total lepton number for the reactants and products, and require that this value 
does not change in the reaction. 

Solution 

For decay (a), the electron-lepton number of the neutron is 0, and the net electron-lepton number 
of the decay products is 0 + 1 + (—1) = 0. Since the net electron-lepton numbers before and 
after the decay are the same, the decay is possible on the basis of the law of conservation of 
electron-lepton number. Also, since there are no muons or taus involved in this decay, the muon- 
lepton and tauon-lepton numbers are conserved. 

For decay (b), the muon-lepton number of the 7” is 0, and the net muon-lepton number of the 
proposed decay products is 1 + 1 + (—1) = 1. Thus, on the basis of the law of conservation of 
muon-lepton number, this decay cannot occur. 

Significance 

Lepton number is conserved in the first reaction, but not in the second. Lepton number 
conservation constrains what reactions can and cannot occur in nature. 


Note: 
Exercise: 


Problem: 
Check Your Understanding What is the lepton number of an electron-positron pair? 


Solution: 


0 


Strangeness Conservation 


In the late 1940s and early 1950s, cosmic-ray experiments revealed the existence of particles that 
had never been observed on Earth. These particles were produced in collisions of pions with 
protons or neutrons in the atmosphere. Their production and decay were unusual. They were 
produced in the strong nuclear interactions of pions and nucleons, and were therefore inferred to 
be hadrons; however, their decay was mediated by the much more slowly acting weak nuclear 
interaction. Their lifetimes were on the order of 10-1? to 10-8 s, whereas a typical lifetime for a 
particle that decays via the strong nuclear reaction is 10° s. These particles were also unusual 
because they were always produced in pairs in the pion-nucleon collisions. For these reasons, 
these newly discovered particles were described as strange. The production and subsequent 
decay of a pair of strange particles is illustrated in [link] and follows the reaction 

Equation: 


nx +p—>A°+K°®. 


The lambda particle then decays through the weak nuclear interaction according to 
Equation: 


A° + x +p, 


and the kaon decays via the weak interaction 
Equation: 


K9 sattn. 


(b) 


The interactions of hadrons. (a) Bubble chamber photograph; (b) sketch that 


represents the photograph. 


To rationalize the behavior of these strange particles, particle physicists invented a particle 
property conserved in strong interactions but not in weak interactions. This property is called 
strangeness and, as the name suggests, is associated with the presence of a strange quark. The 
strangeness of a particle is equal to the number of strange quarks of the particle. Strangeness 
conservation requires the total strangeness of a reaction or decay (summing the strangeness of all 
the particles) is the same before and after the interaction. Strangeness conservation is not 
absolute: It is conserved in strong interactions and electromagnetic interactions but not in weak 
interactions. The strangeness number for several common particles is given in [Link]. 


Example: 

Strangeness Conservation 

(a) Based on the conservation of strangeness, can the following reaction occur? 
Equation: 


he ei 


(b) The following decay is mediated by the weak nuclear force: 
Equation: 


KS 


Does the decay conserve strangeness? If not, can the decay occur? 

Strategy 

Determine the strangeness of the reactants and products and require that this value does not 
change in the reaction. 

Solution 


a. The net strangeness of the reactants is 0 + 0 = 0, and the net strangeness of the products is 
1+ (—1) + 0 =0. Thus, the strong nuclear interaction between a pion and a proton is not 
forbidden by the law of conservation of strangeness. Notice that baryon number is also 
conserved in the reaction. 

b. The net strangeness before and after this decay is 1 and 0, so the decay does not conserve 
strangeness. However, the decay may still be possible, because the law of conservation of 
strangeness does not apply to weak decays. 


Significance 
Strangeness is conserved in the first reaction, but not in the second. Strangeness conservation 
constrains what reactions can and cannot occur in nature. 


Note: 


Exercise: 


Problem: Check Your Understanding What is the strangeness number of a muon? 


Solution: 


0 


Summary 
e Elementary particle interactions are governed by particle conservation laws, which can be 
used to determine what particle reactions and decays are possible (or forbidden). 
e The baryon number conservation law and the three lepton number conversation law are 


valid for all physical processes. However, conservation of strangeness is valid only for 
strong nuclear interactions and electromagnetic interactions. 


Conceptual Questions 


Exercise: 


Problem: What are six particle conservation laws? Briefly describe them. 
Solution: 
Conservation energy, momentum, and charge (familiar to classical and relativistic 


mechanics). Also, conservation of baryon number, lepton number, and strangeness— 
numbers that do not change before and after a collision or decay. 


Exercise: 


Problem: In general, how do we determine if a particle reaction or decay occurs? 
Exercise: 


Problem: 


Why might the detection of particle interaction that violates an established particle 
conservation law be considered a good thing for a scientist? 


Solution: 


It means that the theory that requires the conservation law is not understood. The failure of a 
long-established theory often leads to a deeper understanding of nature. 


Problems 


Exercise: 


Problem: 


Which of the following decays cannot occur because the law of conservation of lepton 
number is violated? 


c) mt et + vet Up 


Solution: 


a, b, andc 
Exercise: 
Problem: 


Which of the following reactions cannot because the law of conservation of strangeness is 
violated? 


(a)ptn>pt+pta 
(b)pt+n—>p+p+K™ 
(ce) K tps kK 45% 
(d)m7 +p7>K*+ 5>— 


Exercise: 


Problem: Identify one possible decay for each of the following antiparticles: 


(a) n, (b) A°, (c) Ot, (d) K~, and (e) E. 


Solution: 


a. p.' ve; b. pr* or pr; c. E°n® or ACKT; d. wv, or 77° 


7. pr or nT 
Exercise: 
Problem: 


Each of the following strong nuclear reactions is forbidden. Identify a conservation law that 
is violated for each one. 


p>p+n+p 


(a) p 
b)p+iSp+p+a+a 
(c) 7 

( 


c)am +p L++K~ 
d)K +p A°+n 


Glossary 


baryon number 
baryon number has the value B = +1 for baryons, — 1 for antibaryons, and 0 for all other 
particles and is conserved in particle interactions 


lepton number 
electron-lepton number L,, the muon-lepton number L,,, and the tau-lepton number L, are 
conserved separately in every particle interaction 


strangeness 
particle property associated with the presence of a strange quark 


Quarks 
By the end of this section, you will be able to: 


e Compare and contrast the six known quarks 

e Use quark composition of hadrons to determine the total charge of these 
particles 

e Explain the primary evidence for the existence of quarks 


In the 1960s, particle physicists began to realize that hadrons are not elementary 
particles but are made of particles called quarks. (The name ‘quark’ was coined 
by the physicist Murray Gell-Mann, from a phrase in the James Joyce novel 
Finnegans Wake.) Initially, it was believed there were only three types of 
quarks, called up (u), down (d), and strange (s). However, this number soon 
grew to six—interestingly, the same as the number of leptons—to include 
charmed (c), bottom (b), and top (t). 


All quarks are spin-half fermions (s = 1/2), have a fractional charge 

(1/3 or 2/3e), and have baryon number B = 1/3. Each quark has an antiquark 
with the same mass but opposite charge and baryon number. The names and 
properties of the six quarks are listed in [link]. 


Charge Spin Baryon Strangeness 
Quark (units of e) (s) number number 
Down 
—1/3 1/2 1/3 0 
(a) / 
Up (u) +2/3 1/2 1/3 0 
Strange 
—1/3 1/2 1/3 —1 
6 / 
ae | 458 1/2 1/3 0 


(c) 


Charge Spin Baryon Strangeness 


Quark (units of e) (s) number number 
Bottom 
—1/3 1/2 1/3 0 
(b) / 
Top (t) +2/3 1/2 1/3 0 
Quarks 


Quark Combinations 


As mentioned earlier, quarks bind together in groups of two or three to form 
hadrons. Baryons are formed from three quarks. Sample baryons, including 
quark content and properties, are given in [link]. Interestingly, the delta plus ( 
A*) baryon is formed from the same three quarks as the proton, but the total 
spin of the particle is 3/2 rather than 1/2. Similarly, the mass of A‘ with spin 
3/2 is 1.3 times the mass of the proton, and the delta zero (A°) baryon witha 
spin 3/2 is 1.3 times the neutron mass. Evidently, the energy associated with the 
spin (or angular momentum) of the particle contributes to its mass energy. It is 
also interesting that no baryons are believed to exist with top quarks, because 
top quarks decay too quickly to bind to the other quarks in their production. 


Charge Mass ( 
(unit of Spin GeV/c? 
Name Symbol Quarks e) (s) ) 
Proton ) uud i 1/2 0.938 
Neutron n udd 0 1/2 0.940 
Det = uuu 2 3/2 1.232 


plus plus 


Name 


Delta 
plus 


Delta 
Zero 


Delta 
minus 


Lambda 
zero 


Positive 
sigma 


Neutral 
sigma 


Negative 
xi 


Neutral 
xi 


Omega 
minus 


Charmed 
lambda 


Charmed 
bottom 


Baryon Quarks 


Symbol 


A? 


yt 


Quarks 


uud 


udd 


ddd 


uds 


uus 


uds 


sds 


SUS 


SSS 


udc 


udb 


Charge 
(unit of 


e) 


Spin 
(s) 


3/2 


3/2 


3/2 


1/2 


1/2 


1/2 


1/2 


1/2 


3/2 


1/2 


1/2 


1.232 


1.232 


1.116 


1.189 


1.192 


1.321 


1.315 


1.672 


2.281 


5.619 


Mesons are formed by two quarks—a quark-antiquark pair. Sample mesons, 
including quark content and properties, are given in [link]. Consider the 


formation of the pion (7+ = ud). Based on its quark content, the charge of the 
pion is 
Equation: 


24 
3° a 


Both quarks are spin-half (s = %), so the resultant spin is either 0 or 1. The spin 
of the 7* meson is 0. The same quark-antiquark combination gives the rho (p) 
meson with spin 1. This meson has a mass approximately 5.5 times that of the 
a? meson. 


Example: 

Quark Structure 

Show that the quark composition given in [link] for 2° is consistent with the 
known charge, spin, and strangeness of this baryon. 

Strategy 

=° is composed of two strange quarks and an up quark (s us). We can add 
together the properties of quarks to predict the resulting properties of the =° 
baryon. 

Solution 

The charge of the s quark is —e/3 and the charge of the u quark is 2e/3. Thus, 
the combination (s u s) has no net charge, in agreement with the known charge 
of =”. Since three spin —1 /2 quarks can combine to produce a particle with 
spin of either 1/2 or 3/2, the quark composition is consistent with the known 
spin (s = 1/2) of &°. Finally, the net strangeness of the (s u s) combination is 
(—1) + 0+ (—1) = —2, which also agrees with experiment. 

Significance 

The charge, spin, and strangeness of the =? particle can be determined from the 
properties of its constituent quarks. The great diversity of baryons and mesons 
can be traced to the properties of just six quarks: up, down, charge, strange, top, 
and bottom. 


Note: 
Exercise: 


Problem: 


Check Your Understanding What is the baryon number of a pion? 


Solution: 
0 

Charge Mass 
Name Symbol Quarks (e) Spin (GeV/c?) 
Positive xt ~ 1 0 0.140 
pion ud 
Positive e - 1 1 0.768 
rho i ud 
Negative _ = = 
pion 7 ud 1 0 0.140 
Negative _ _ 7 
ne p ud 1 1 0.768 
Se 7? wu OF 0 0 ee 

ion dd 

Neutral 7 uu, dd 0 0 0.547 
eta 


Name 


Positive 
kaon 


Neutral 
kaon 


Negative 
kaon 


J/Psi 


Charmed 
eta 


Neutral 
D 


Neutral 
D 


Positive 
D 


Neutral 
B 


Upsilon 
Meson Quarks 


Color 


J/w 


Db? 


Dt 


B? 


Quarks 


Spin 


Mass 
(GeV/c?) 


0.494 


0.498 


2.98 


1.86 


2.01 


1.87 


5.26 


9.46 


Quarks are fermions that obey Pauli’s exclusion principle, so it might be 
surprising to learn that three quarks can bind together within a nucleus. For 


example, how can two up quarks exist in the same small region of space within 
a proton? The solution is to invent a third new property to distinguish them. 
This property is called color, and it plays the same role in the strong nuclear 
interaction as charge does in electromagnetic interactions. For this reason, quark 
color is sometimes called “strong charge.” 


Quarks come in three colors: red, green, and blue. (These are just labels— 
quarks are not actually colored.) Each type of quark (u, d, c, s, b, t) can possess 
any other colors. For example, three strange quarks exist: a red strange quark, a 
green strange quark, and a blue strange quark. Antiquarks have anticolor. 
Quarks that bind together to form hadrons (baryons and mesons) must be color 
neutral, colorless, or “white.” Thus, a baryon must contain a red, blue, and green 
quark. Likewise, a meson contains either a red-antired, blue-antiblue, or green- 
antigreen quark pair. Thus, two quarks can be found in the same spin state in a 
hadron, without violating Pauli’s exclusion principle, because their colors are 
different. 


Quark Confinement 


The first strong evidence for the existence of quarks came from a series of 
experiments performed at the Stanford Linear Accelerator Center (SLAC) and at 
CERN around 1970. This experiment was designed to probe the structure of the 
proton, much like Rutherford studied structure inside the atom with his a- 
particle scattering experiments. Electrons were collided with protons with 
energy in excess of 20 GeV. At this energy, EF ~ pc, so the de Broglie 
wavelength of an electron is 

Equation: 


h 
= == 6 x 10727 m. 


The wavelength of the electron is much smaller than the diameter of the proton 
(about 10~!° m). Thus, like an automobile traveling through a rocky mountain 
range, electrons can be used to probe the structure of the nucleus. 


The SLAC experiments found that some electrons were deflected at very large 
angles, indicating small scattering centers within the proton. The scattering 
distribution was consistent with electrons being scattered from sites with spin 


1/2, the spin of quarks. The experiments at CERN used neutrinos instead of 
electrons. This experiment also found evidence for the tiny scattering centers. In 
both experiments, the results suggested that the charges of the scattering 
particles were either +2/3e or —1/3e, in agreement with the quark model. 


Note: 
Watch this video to learn more about quarks. 


The quark model has been extremely successful in organizing the complex 
world of subatomic particles. Interestingly, however, no experiment has ever 
produced an isolated quark. All quarks have fractional charge and should 
therefore be easily distinguishable from the known elementary particles, whose 
charges are all an integer multiple of e. Why are isolated quarks not observed? 
In current models of particle interactions, the answer is expressed in terms of 
quark confinement. Quark confinement refers to the confinement of quarks in 
groups of two or three in a small region of space. The quarks are completely 
free to move about in this space, and send and receive gluons (the carriers of the 
strong force). However, if these quarks stray too far from one another, the strong 
force pulls them back it. This action is likened to a bola, a weapon used for 
hunting ([{link]). The stones are tied to a central point by a string, so none of the 
rocks can move too far from the others. The bola corresponds to a baryon, the 
stones correspond to quarks, and the string corresponds to the gluons that hold 
the system together. 


Quark confinement 


Quarks 


A baryon is analogous to a bola, a weapon 
used for hunting. The rocks in this image 
correspond to the baryon quarks. The 
quarks are free to move about but must 
remain close to the other quarks. 


Summary 


e Six known quarks exist: up (u), down (d), charm (c), strange (s), top (0), 
and bottom (b). These particles are fermions with half-integral spin and 
fractional charge. 

¢ Baryons consist of three quarks, and mesons consist of a quark-antiquark 
pair. Due to the strong force, quarks cannot exist in isolation. 

e Evidence for quarks is found in scattering experiments. 


Conceptual Questions 


Exercise: 


Problem: What are the six known quarks? Summarize their properties. 
Exercise: 

Problem: What is the general quark composition of a baryon? Of a meson? 

Solution: 

3 quarks, 2 quarks (a quark-antiquark pair) 


Exercise: 


Problem: What evidence exists for the existence of quarks? 
Exercise: 


Problem: 


Why do baryons with the same quark composition sometimes differ in their 
rest mass energies? 


Solution: 


Baryons with the same quark composition differ in rest energy because this 
energy depends on the internal energy of the quarks (m =e Lg c’). So, a 
baryon that contains a quark with a large angular momentum is expected to 


be more massive than the same baryon with less angular momentum. 
Problems 


Exercise: 


Problem: 
Based on quark composition of a proton, show that its charge is +1. 


Solution: 


A proton consists of two up quarks and one down quark. The total charge 
of a proton is therefore + ; + ; +— ; = +1. 


Exercise: 


Problem: 


Based on the quark composition of a neutron, show that is charge is 0. 
Exercise: 
Problem: 


Argue that the quark composition given in [link] for the positive kaon is 
consistent with the known charge, spin, and strangeness of this baryon. 


Solution: 


The Kt meson is composed of an up quark and a strange antiquark (us). 
Since the changes of this quark and antiquark are 2e/3 and e/3, 
respectively, the net charge of the K* meson is e, in agreement with its 
known value. Two spin —1/2 particles can combine to produce a particle 
with spin of either 0 or 1, consistent with the K* meson’s spin of 0. The 
net strangeness of the up quark and strange antiquark is 0 + 1 = 1, in 
agreement with the measured strangeness of the K* meson. 


Exercise: 


Problem: 


Mesons are formed from the following combinations of quarks (subscripts 


indicate color and AR = antired): (dp, d AR); (Sq; UAG), and (SR, SAR): 


(a) Determine the charge and strangeness of each combination. (b) Identify 
one or more mesons formed by each quark-antiquark combination. 


Exercise: 


Problem: Why can’t either set of quarks shown below form a hadron? 


(a) 
(b) 


Solution: 


a. color; b. quark-antiquark 
Exercise: 
Problem: 
Experimental results indicate an isolate particle with charge +2/3—an 


isolated quark. What quark could this be? Why would this discovery be 
important? 


Exercise: 


Problem: 
Express the 8 decaysn > p+e  +vandp—>n-+e+t-+v in terms of 8 


decays of quarks. Check to see that the conservation laws for charge, 
lepton number, and baryon number are satisfied by the quark 6 decays. 


Solution: 


d>ut+e +vjurdtett+ue 


Glossary 


color 
property of particles and that plays the same role in strong nuclear 
interactions as electric charge does in electromagnetic interactions 


Particle Accelerators and Detectors 
By the end of this section, you will be able to: 


¢ Compare and contrast different types of particle accelerators 

e Describe the purpose, components, and function of a typical colliding 
beam machine 

e Explain the role of each type of subdetector of a typical multipurpose 
particle detector 

e Use the curvature of a charge track to determine the momentum of a 
particle 


The goal of experimental particle physics is to accurately measure 
elementary particles. The primary method used to achieve this end is to 
produce these particles in high-energy collisions and then measure the 
products of using highly sensitive particle detectors. These experiments are 
used to test and revise scientific models of particle interactions. The 
purpose of this section is to describe particle accelerators and detectors. 
Modern machines are based on earlier ones, so it is helpful to present a brief 
history of accelerators and detectors. 


Early Particle Accelerators 


A particle accelerator is a machine designed to accelerate charged 
particles. This acceleration is usually achieved with strong electric fields, 
magnetic fields, or both. A simple example of a particle accelerator is the 
Van de Graaff accelerator (see Electric Potential). This type of accelerator 
collects charges on a hollow metal sphere using a moving belt. When the 
electrostatic potential difference of the sphere is sufficiently large, the field 
is used to accelerate particles through an evacuated tube. Energies produced 
by a Van de Graaff accelerator are not large enough to create new particles, 
but the machine was important for early exploration of the atomic nucleus. 


Larger energies can be produced by a linear accelerator (called a “linac’). 
Charged particles produced at the beginning of the linac are accelerated by 
a continuous line of charged hollow tubes. The voltage between a given pair 
of tubes is set to draw the charged particle in, and once the particle arrives, 
the voltage between the next pair of tubes is set to push the charged particle 


out. In other words, voltages are applied in such a way that the tubes deliver 
a series of carefully synchronized electric kicks ({link]). Modern linacs 
employ radio frequency (RF) cavities that set up oscillating electromagnetic 
fields, which propel the particle forward like a surfer on an ocean wave. 
Linacs can accelerate electrons to over 100 MeV. (Electrons with kinetic 
energies greater than 2 MeV are moving very close to the speed of light.) In 
modern particle research, linear accelerators are often used in the first stage 
of acceleration. 


Particle beam 


Particle accelerates 
between tubes 


In a linear accelerator, charged tubes accelerate particles in a series of 
electromagnetic kicks. Each tube is longer than the preceding tube 
because the particle is moving faster as it accelerates. 


Example: 

Accelerating Tubes 

A linear accelerator designed to produce a beam of 800-MeV protons has 
2000 accelerating tubes separated by gaps. What average voltage must be 
applied between tubes to achieve the desired energy? (Hint: U = qV. ) 
Strategy 

The energy given to the proton in each gap between tubes is U = qV, 
where q is the proton’s charge and V is the potential difference (voltage) 
across the gap. Since g = qe = 1.6 x 101°C and 

Levi (0) (1.6 Sule ene the proton gains 1 eV in energy for each 
volt across the gap that it passes through. The ac voltage applied to the 
tubes is timed so that it adds to the energy in each gap. The effective 
voltage is the sum of the gap voltages and equals 800 MV to give each 
proton an energy of 800 MeV. 

Solution 

There are 2000 gaps and the sum of the voltages across them is 800 MV. 
Therefore, the average voltage applied is 0.4 MV or 400 kV. 
Significance 

A voltage of this magnitude is not difficult to achieve in a vacuum. Much 
larger gap voltages would be required for higher energy, such as those at 
the 50-GeV SLAC facility. Synchrotrons are aided by the circular path of 
the accelerated particles, which can orbit many times, effectively 
multiplying the number of accelerations by the number of orbits. This 
makes it possible to reach energies greater than 1 TeV. 


Note: 
Exercise: 


Problem: 


Check Your Understanding How much energy does an electron 
receive in accelerating through a 1-V potential difference? 


Solution: 


1leV 


The next-generation accelerator after the linac is the cyclotron (({link]). A 
cyclotron uses alternating electric fields and fixed magnets to accelerate 
particles in a circular spiral path. A particle at the center of the cyclotron is 
first accelerated by an electric field in a gap between two D-shaped magnets 
(Dees). As the particle crosses over the D-shaped magnet, the particle is 
bent into a circular path by a Lorentz force. (The Lorentz force was 
discussed in Magnetic Forces and Fields.) Assuming no energy losses, the 
momentum of the particle is related to its radius of curvature by 


Note: 
Equation: 


p=0.3Br 


where p is the momentum in GeV/c, B is in teslas, and r is the radius of the 
trajectory (“orbit”) in meters. This expression is valid to classical and 
relativistic velocities. The circular trajectory returns the particle to the 
electric field gap, the electric field is reversed, and the process continues. 
As the particle is accelerated, the radius of curvature gets larger and larger 
—spirally outward—until the electrons leave the device. 


External beam 


Cyclotrons use a magnetic field to cause particles 
to move in circular orbits. As the particles pass 
between the plates of the “Dees,” the voltage 
across the gap is reversed so the particles are 
accelerated twice in each orbit. 


Note: 
Watch this video to learn more about cyclotrons. 


A synchrotron is a circular accelerator that uses alternating voltage and 
increasing magnetic field strength to accelerate particles to higher energies. 
Charged particles are accelerated by RF cavities, and steered and focused 
by magnets. RF cavities are synchronized to deliver “kicks” to the particles 
as they pass by, hence the name. Steering high-energy particles requires 
strong magnetic fields, so superconducting magnets are often used to reduce 
heat losses. As the charged particles move in a circle, they radiate energy: 


According to classical theory, any charged particle that accelerates (and 
circular motion is an accelerated motion) also radiates. In a synchrotron, 
such radiation is called synchrotron radiation. This radiation is useful for 
many other purposes, such as medical and materials research. 


Example: 

The Energy of an Electron in a Cyclotron 

An electron is accelerated using a cyclotron. If the magnetic field is 1.5 T 
and the radius of the “Dees” is 1.2 m, what is the kinetic energy of the 
outgoing particle? 

Strategy 

If the radius of orbit of the electron exceeds the radius of the “Dees,” the 
electron exits the device. So, the radius of the “Dees” places an upper limit 
on the radius and, therefore, the momentum and energy of the accelerated 
particle. The exit momentum of the particle is determined using the radius 
of orbit and strength of the magnetic field. The exit energy of the particle 
can be determined the particle momentum (Relativity). 

Solution 

Assuming no energy losses, the momentum of the particle in the cyclotron 
is 

Equation: 


p = 0.3Br = 0.3(1.5 T)(1.2 m) = 0.543 GeV/c. 


The momentum energy pce? = 0.543 GeV = 543 MeV is much larger 
than the rest mass energy of the electron, mc”? = 0.511 MeV, so 
relativistic expression for the energy of the electron must be used (see 
Relativity). The total energy of the electron is 


Equation: 
Ia — 4V) || pe)’ + (me?)* = / (543)? + (0.511) 543)? + (0. (0.511) ~ 543 MeV and 
Equation: 


K = Etotal — mc? = 543 GeV — 0.511 GeV ~ 543 MeV. 


Significance 

The total energy of the electron is much larger than its rest mass energy. In 
other words, the total energy of the electron is almost all in the form of 
kinetic energy. Cyclotrons can be used to conduct nuclear physics 
experiments or in particle therapy to treat cancer. 


Note: 
Exercise: 


Problem: 


Check Your Understanding A charged particle of a certain 
momentum travels in an arc through a uniform magnetic field. What 
happens if the magnetic field is doubled? 


Solution: 


The radius of the track is cut in half. 


Colliding Beam Machines 


New particles can be created by colliding particles at high energies. 
According to Einstein’s mass-energy relation, the energies of the colliding 
particles are converted into mass energy of the created particle. The most 
efficient way to do this is with particle-colliding beam machines. A 
colliding beam machine creates two counter-rotating beams in a circular 
accelerator, stores the beams at constant energy, and then at the desired 
moment, focuses the beams on one another at the center of a sensitive 
detector. 


The prototypical colliding beam machine is the Cornell Electron Storage 
Ring, located in Ithaca, New York ({link]). Electrons (e~ ) and positrons (e* 
) are created at the beginning of the linear accelerator and are accelerated up 
to 150 MeV. The particles are then injected into the inner synchrotron ring, 


where they are accelerated by RF cavities to 4.5 to 6 GeV. When the beams 
are up to speed, they are transferred and “stored” in an outer storage ring at 
the same energy. The two counter-rotating beams travel through the same 
evacuated pipe, but are kept apart until collisions are desired. The electrons 
and positrons circle the machine in bunches 390,000 times every second. 
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The Cornell Electron Storage Ring uses a linear accelerator and a 
synchrotron to accelerate electrons and positrons to 4.5—6 GeV. The 
particles are held in the outer storage ring at that energy until they 
are made to collide in a particle detector. (credit: modification of 
work by Laboratory of Nuclear Studies, Cornell Electron Storage 
Ring) 


When an electron and positron collide, they annihilate each other to 
produce a photon, which exists for too short a time to be detected. The 
photon produces either a lepton pair (e.g., an electron and position, muon or 
antimuon, or tau and antitau) or a quark pair. If quarks are produced, 


mesons form, such as cc and bb. These mesons are created nearly at rest 
since the initial total momentum of the electron-positron system is zero. 
Note, mesons cannot be created at just any colliding energy but only at 
“resonant” energies that correspond to the unique masses of the mesons 
({link]). The mesons created in this way are highly unstable and decay 
quickly into lighter particles, such as electrons, protons, and photons. The 
collision “fragments” provide valuable information about particle 
interactions. 


As the field of particle physics advances, colliding beam machines are 
becoming more powerful. The Large Hadron Collider (LHC), currently the 
largest accelerator in the world, collides protons at beam energies exceeding 
6 TeV. The center-of-mass energy (W) refers to the total energy available to 
create new particles in a colliding machine, or the total energy of incoming 
particles in the center-of-mass frame. (The concept of a center-of-mass 
frame of reference is discussed in Linear Momentum and Collisions.) 
Therefore, the LHC is able to produce one or more particles with a total 
mass exceeding 12 TeV. The center-of-mass energy is given by: 


Note: 
Equation: 


W? = 2[E, E> + (pic) (pac) + (mic2)” + (myec?)’, 


where & and are the total energies of the incoming particles (1 and 2), 
1 and po are the magnitudes of their momenta, and m , and mz are their 
rest masses. 


Example: 
Creating a New Particle 


The mass of the upsilon (1) meson (6b) is created in a symmetric electron- 
positron collider. What beam energy is required? 

Strategy 

The Particle Data Group has stated that the rest mass energy of this meson 
is approximately 10.58 GeV. The above expression for the center-of-mass 
energy can be simplified because a symmetric collider implies p; = —py5. 
Also, the rest masses of the colliding electrons and positrons are identical 
(Gree? 0.511 MeV) and much smaller than the mass of the energy 
particle created. Thus, the center-of-mass energy (W) can be expressed 
completely in terms of the beam energy, Fiean = £, = Er. 

Solution 

Based on the above assumptions, we have 

Equation: 


W? = 2[E, EF, + EE] = 4E,E, = 4E?. 


The beam energy is therefore 
Equation: 


W 
Feam © Ey = 9° 


The rest mass energy of the particle created in the collision is equal to the 
center-of-mass energy, SO 
Equation: 


10.58 GeV 
Beam & ——-— 2 = ROO ERNE. 


Significance 

Given the energy scale of this problem, the rest mass energy of the upsilon 
( Y) meson is due almost entirely due to the initial kinetic energies of the 
electron and positrons. This meson is highly unstable and quickly decays to 
lighter and more stable particles. The existence of the upsilon (7) particle 
appears as a dramatic increase of such events at 5.29 GeV. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Why is a symmetric collider 
“symmetric?” 


Solution: 


The colliding particles have identical mass but opposite vector 
momenta. 


Higher beam energies require larger accelerators, so modem colliding beam 
machines are very large. The LHC, for example, is 17 miles in 
circumference ([link]). (In the 1940s, Enrico Fermi envisioned an 
accelerator that encircled all of Earth!) An important scientific challenge of 
the twenty-first century is to reduce the size of particle accelerators. 


Particle Detectors 


The purpose of a particle detector is to accurately measure the outcome of 
collisions created by a particle accelerator. The detectors are multipurpose. 
In other words, the detector is divided into many subdetectors, each 
designed to measure a different aspect of the collision event. For example, 
one detector might be designed to measure photons and another might be 


designed to measure muons. To illustrate how subdetectors contribute to an 
understanding of an entire collision event, we describe the subdetectors of 
the Compact Muon Solenoid (CMS), which was used to discover the Higgs 
Boson at the LHC ((link]). 
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Compact Muon Solenoid detector. The detector consists of several 
layers, each responsible for measuring different types of particles. 
(credit: modification of work by David Barney/CERN) 


The beam pipe of the detector is out of (and into) the page at the left. 
Particles produced by pp collisions (the “collision fragments”) stream out of 
the detector in all directions. These particles encounter multiple layers of 
subdetectors. A subdetector is a particle detector within a larger system of 
detectors designed to measure certain types of particles. There are several 
main types of subdetectors. Tracking devices determine the path and 


therefore momentum of a particle; calorimeters measure a particle’s energy; 
and particle-identification detectors determine a particle’s identity (mass). 


The first set of subdetectors that particles encounter is the silicon tracking 
system. This system is designed to measure the momentum of charged 
particles (such as electrons and protons). The detector is bathed in a 
uniform magnetic field, so the charged particles are bent in a circular path 
by a Lorentz force (as for the cyclotron). If the momentum of the particle is 
large, the radius of the trajectory is large, and the path is almost straight. 
But if the momentum is small, the radius of the trajectory is small, and the 
path is tightly curved. As the particles pass through the detector, they 
interact with silicon microstrip detectors at multiple points. These detectors 
produce small electrical signals as the charged particles pass near the 
detector elements. The signals are then amplified and recorded. A series of 
electrical “hits” is used to determine the trajectory of the particle in the 
tracking system. A computer-generated “best fit” to this trajectory gives the 
track radius and therefore the particle momentum. At the LHC, a large 
number of tracks are recorded for the same collision event. Fits to the tracks 
are shown by the blue and green lines in [Link]. 


A three-dimensional view of a heavy-ion collision event in 
the LHC as seen by the ALICE detector. (credit: 
LHC/CERN) 


Beyond the tracking layers is the electromagnetic calorimeter. This detector 
is made of clear, lead-based crystals. When electrons interact with the 
crystals, they radiate high-energy photons. The photons interact with the 
crystal to produce electron-positron pairs. Then, these particles radiate more 
photons. The process repeats, producing a particle shower (the crystal 
“glows”). A crude model of this process is as follows. 


An electron with energy £o strikes the crystal and loses half of its energy in 
the form of a photon. The photon produces an electron-positron pair, and 
each particle proceeds away with half the energy of the photon. Meanwhile, 


the original electron radiates again. So, we are left with four particles: two 
electrons, one positron, and one photon, each with an energy Ey /4. The 
number of particles in the shower increases geometrically. After n radiation 
events, there are VV = 2” particles. Hence, the total energy per particle after 
n radiation events is 

Equation: 


E(t) =>, 


where £ is the incident energy and E(t) is the amount of energy per 
particle after n events. An incoming photon triggers a similar chain of 
events ([link]). If the energy per particle drops below a particular threshold 
value, other types of radiative processes become important and the particle 
shower ceases. Eventually, the total energy of the incoming particle is 
absorbed and converted into an electrical signal. 


(a) (b) 


(a) A particle shower produced in a crystal calorimeter. (b) A diagram 
showing a typical sequence of reactions in a particle shower. 


Beyond the crystal calorimeter is the hadron calorimeter. As the name 
suggests, this subdetector measures hadrons such as protons and pions. The 


hadron calorimeter consists of layers of brass and steel separated by plastic 
scintillators. Its purpose is to absorb the particle energy and convert it into 
an electronic signal. Beyond this detector is a large magnetic coil used to 
produce a uniform field for tracking. 


The last subdetector is the muon detector, which consists of slabs of iron 
that only muons (and neutrinos) can penetrate. Between the iron slabs are 
multiple types of muon-tracking elements that accurately measure the 
momentum of the muon. The muon detectors are important because the 
Higgs boson (discussed soon) can be detected through its decays to four 
muons—hence the name of the detector. 


Once data is collected from each of the particle subdetectors, the entire 
collision event can be assessed. The energy of the ith particle is written 
Equation: 


B, = Vf (pie)® + (mie2)?”, 


where p; is the absolute magnitude of the momentum of the ith particle, and 
mM, is its rest mass. 


The total energy of all particles is therefore 


Equation: 
FEtotal = , Ej. 
i 


If all particles are detected, the total energy should be equal to the center-of- 
mass energy of the colliding beam machine (W). In practice, not all particles 
are identified, either because these particles are too difficult to detect 
(neutrinos) or because these particles “slip through.” In many cases, whole 
chains of decays can be “reconstructed,” like putting back together a watch 
that has been smashed to pieces. Information about these decay chains are 
critical to the evaluation of models of particle interactions. 


Summary 


e Many types of particle accelerators have been developed to study 
particles and their interactions. These include linear accelerators, 
cyclotrons, synchrotrons, and colliding beams. 

e Colliding beam machines are used to create massive particles that 
decay quickly to lighter particles. 

e Multipurpose detectors are used to design all aspects of high-energy 
collisions. These include detectors to measure the momentum and 
energies of charge particles and photons. 

e Charged particles are measured by bending these particles in a circle 
by a magnetic field. 

e Particles are measured using calorimeters that absorb the particles. 


Conceptual Questions 


Exercise: 
Problem: 
Briefly compare the Van de Graaff accelerator, linear accelerator, 
cyclotron, and synchrotron accelerator. 
Exercise: 
Problem: 


Describe the basic components and function of a typical colliding 
beam machine. 


Solution: 


the “linac” to accelerate the particles in a straight line, a synchrotron to 
accelerate and store the moving particles in a circular ring, and a 
detector to measure the products of the collisions 


Exercise: 


Problem: 
What are the subdetectors of the Compact Muon Solenoid experiment? 
Briefly describe them. 
Exercise: 
Problem: 


What is the advantage of a colliding-beam accelerator over one that 
fires particles into a fixed target? 


Solution: 


In a colliding beam experiment, the energy of the colliding particles 
goes into the rest mass energy of the new particle. In a fix-target 
experiment, some of this energy is lost to the momentum of the new 
particle since the center-of-mass of colliding particles is not fixed. 


Exercise: 
Problem: 


An electron appears in the muon detectors of the CMS. How is this 
possible? 


Problems 


Exercise: 
Problem: 
A charged particle in a 2.0-T magnetic field is bent in a circle of radius 
75 cm. What is the momentum of the particle? 


Exercise: 


Problem: 


A proton track passes through a magnetic field with radius of 50 cm. 
The magnetic field strength is 1.5 T. What is the total energy of the 
proton? 


Solution: 


965 GeV 
Exercise: 
Problem: 
Derive the equation p = 0.3Br using the concepts of centripetal 


acceleration (Motion in Two and Three Dimensions) and relativistic 
momentum (Relativity) 


Exercise: 
Problem: 
Assume that beam energy of an electron-positron collider is 
approximately 4.73 GeV. What is the total mass (W) of a particle 


produced in the annihilation of an electron and positron in this 
collider? What meson might be produced? 


Solution: 


According to [link], 

W = 2Ebeam = 9.46 GeV, 

M = 9.46 GeV/c’. 

This is the mass of the upsilon (1S) meson first observed at Fermi lab 
in 1977. The upsilon meson consists of a bottom quark and its 


antiparticle (06) ; 


Exercise: 


Problem: 


At full energy, protons in the 2.00-km-diameter Fermilab synchrotron 
travel at nearly the speed of light, since their energy is about 1000 
times their rest mass energy. (a) How long does it take for a proton to 
complete one trip around? (b) How many times per second will it pass 
through the target area? 


Exercise: 


Problem: 


Suppose a W ~~ created in a particle detector lives for 5.00 x 10°?°s. 
What distance does it move in this time if it is traveling at 0.900c? 
(Note that the time is longer than the given W ~ lifetime, which can be 
due to the statistical nature of decay or time dilation.) 


Solution: 


0.135 fm; Since this distance is too short to make a track, the presence 
of the W~ must be inferred from its decay products. 


Exercise: 


Problem: 


What length track does a 7* traveling at 0.100c leave in a bubble 
chamber if it is created there and lives for 2.60 x 10~°s? (Those 
moving faster or living longer may escape the detector before 
decaying.) 


Exercise: 


Problem: 
The 3.20-km-long SLAC produces a beam of 50.0-GeV electrons. If 
there are 15,000 accelerating tubes, what average voltage must be 


across the gaps between them to achieve this energy? 


Solution: 


3.33 MV 


Glossary 


particle accelerator 
machine designed to accelerate charged particles; this acceleration is 
usually achieved with strong electric fields, magnetic fields, or both 


particle detector 
detector designed to accurately measure the outcome of collisions 
created by a particle accelerator; particle detectors are hermetic and 
multipurpose 


synchrotron 
circular accelerator that uses alternating voltage and increasing 
magnetic field strengths to accelerate particles to higher and higher 
energies 


synchrotron radiation 
high-energy radiation produced in a synchrotron accelerator by the 
circular motion of a charged beam 


The Standard Model 
By the end of this section, you will be able to: 


e Describe the Standard Model in terms of the four fundamental forces and 
exchange particles 

¢ Draw a Feynman diagram for a simple particle interaction 

e Use Heisenberg’s uncertainty principle to determine the range of forces 
described by the Standard Model 

e Explain the rationale behind grand unification theories 


The chief intellectual activity of any scientist is the development and revision of 
scientific models. A particle physicist seeks to develop models of particle 
interactions. This work builds directly on work done on gravity and 
electromagnetism in the seventeenth, eighteenth, and nineteenth centuries. The 
ultimate goal of physics is a unified “theory of everything” that describes all 
particle interactions in terms of a single elegant equation and a picture. The 
equation itself might be complex, but many scientists suspect the idea behind 
the equation will make us exclaim: “How could we have missed it? It was so 
obvious!” 


In this section, we introduce the Standard Model, which is the best current 
model of particle interactions. We describe the Standard Model in detail in 
terms of electromagnetic, weak nuclear, and strong forces. At the end of this 
section, we review unification theories in particle physics. 


Introduction to the Standard Model 


The Standard Model of particle interactions contains two ideas: electroweak 
theory and quantum chromodynamics (QCD) (the force acting between color 
charges). Electroweak theory unifies the theory of quantum electrodynamics 
(QED), the modern equivalent of classical electromagnetism, and the theory of 
weak nuclear interactions. The Standard Model combines the theory of 
relativity and quantum mechanics. 


In the Standard Model, particle interactions occur through the exchange of 
bosons, the “force carriers.” For example, the electrostatic force is 
communicated between two positively charged particles by sending and 
receiving massless photons. This can occur at a theoretical infinite range. The 
result of these interactions is Coulomb repulsion (or attraction). Similarly, 


quarks bind together through the exchange of massless gluons. Leptons scatter 
off other leptons (or decay into lighter particles) through the exchange of 
massive W and Z bosons. A summary of forces as described by the Standard 
Model is given in [link]. The gravitational force, mediated by the exchange of 
massless gravitations, is added in this table for completeness but is not part of 


the Standard Model. 


Force 
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Electromagnetic 


Weak 


Gravitational 
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The Standard Model can be expressed in terms of equations and diagrams. The 
equations are complex and are usually covered in a more advanced course in 
modern physics. However, the essence of the Standard Model can be captured 
using Feynman diagrams. A Feynman diagram, invented by American 
physicist Richard Feynman (1918-1988), is a space-time diagram that describes 
how particles move and interact. Different symbols are used for different 
particles. Particle interactions in one dimension are shown as a time-position 
graph (not a position-time graph). As an example, consider the scattering of an 
electron and electron-neutrino (({link]). The electron moves toward positive 


values of x (to the right) and collides with an electron neutrino moving to the 
left. The electron exchanges a Z boson (charge zero). The electron scatters to 
the left and the neutrino scatters to the right. This exchange is not instantaneous. 
The Z boson travels from one particle to the other over a short period of time. 
The interaction of the electron and neutrino is said to occur via the weak 
nuclear force. This force cannot be explained by classical electromagnetism 
because the charge of the neutrino is zero. The weak nuclear force is discussed 
again later in this section. 


Z° boson emitted 
t from an electron 


Z° boson absorbed by 
an electron-neutrino 


In this Feynman diagram, the exchange of a 
virtual Z° carries the weak nuclear force between 
an electron and a neutrino. 


Electromagnetic Force 


According to QED, the electromagnetic force is transmitted between charged 
particles through the exchange of photons. The theory is based on three basic 
processes: An electron travels from one place to the next, emits or absorbs a 
photon, and travels from one place to another again. When two electrons 


interact, one electron emits the photon and the other receives it ({link]). Photons 
transfer energy and momentum from one electron to the other. The net result in 
this case is a repulsive force. The photons exchanged are virtual. A virtual 
particle is a particle that exists for too short a time to be observable. Since the 
photon transit time At is extremely small, Heisenberg’s uncertainty principle 
states that the uncertainty in the photon’s energy, AF, may be very large. 


Virtual 
photon 


Feynman diagram of two 
electrons interacting 
through the exchange of a 
photon. 


To estimate the range of the electromagnetic interaction, assume that the 
uncertainty on the energy is comparable to the energy of the photon itself, 
written 

Equation: 


AE x E. 


The Heisenberg uncertainly principle states that 
Equation: 


Combining these equations, we have 


Note: 
Equation: 


The energy of a photon is given by F = hf, so 
Equation: 


~~ — 
Nv 


At x ay SS a 
hf f Cc 


The distance d that the photon can move in this time is therefore 
Equation: 
» 
d=cAtsc (=) = 
Cc 


The energy of the virtual photon can be arbitrarily small, so its wavelength can 
be arbitrarily large—in principle, even infinitely large. The electromagnetic 
force is therefore a long-range force. 


Weak Nuclear Force 


The weak nuclear force is responsible for radioactive decay. The range of the 
weak nuclear force is very short (only about 10~'8 m) and like the other forces 
in the Standard Model, the weak force can be described in terms of particle 
exchange. (There is no simple function like the Coulomb force to describe these 
interactions.) The particle exchanged is one of three bosons: Wt, W_, and Z?. 


The Standard Model predicts the existence of these spin-1 particles and also 
predicts their specific masses. In combination with previous experiments, the 
mass of the charged W bosons was predicted to be 81 GeV/c? and that of the 
Z° was predicted to be 90 GeV / c”. A CERN experiment discovered particles in 
the 1980s with precisely these masses—an impressive victory for the model. 


The weak nuclear force is most frequently associated with scattering and decays 
of unstable particles to light particles. For example, neutrons decay to protons 
through the weak nuclear force. This reaction is written 

Equation: 


n—->pt+e +k, 


where n is the neutron, p is a proton, eis an electron, and 2 is a nearly 
massless electron neutrino. This process, called beta decay, is important in 
many physical processes. A Feynman diagram of beta decay is given in [link] 
(a). The neutron emits a Wand becomes a proton, then the W_ produces an 
electron and an antineutrino. This process is similar to the scattering event 
Equation: 


e +p—>n+Up, 
In this process, the proton emits a W* and is converted into a neutron (b). The 


W* then combines with the electron, forming a neutrino. Other electroweak 
interactions are considered in the exercises. 


(a) (b) 


Feynman diagram of particles interacting through the 
exchange of a W boson: (a) beta decay; (b) conversion of 
a proton into a neutron. 


The range of the weak nuclear force can be estimated with an argument similar 
to the one before. Assuming the uncertainty on the energy is comparable to the 
energy of the exchange particle by (E & me’), we have 
Equation: 

h 


At = ——. 
me? 


The maximum distance d that the exchange particle can travel (assuming it 
moves at a speed close to c) is therefore 
Equation: 
h 
d = cAt = —. 


mc 


For one of the charged vector bosons with mc? ~ 80 GeV = 1.28 x 10°°J, 
we obtain mc = 4.27 x 10-1’ J. s/m. Hence, the range of the force mediated 
by this boson is 


Equation: 


1.05 x 10°%4J-s 


pe lr x 1078 mm. 
4.27 x 10°" J-s/m 


Strong Nuclear Force 


Strong nuclear interactions describe interactions between quarks. Details of 
these interactions are described by QCD. According to this theory, quarks bind 
together by sending and receiving gluons. Just as quarks carry electric charge 
[either (+2/3)e or (—1/3)e] that determines the strength of electromagnetic 
interactions between the quarks, quarks also carry “color charge” (either red, 
blue, or green) that determines the strength of strong nuclear interactions. As 
discussed before, quarks bind together in groups in color neutral (or “white”) 
combinations, such as red-blue-green and red-antired. 


Interestingly, the gluons themselves carry color charge. Eight known gluons 
exist: six that carry a color and anticolor, and two that are color neutral ({link] 
(a)). To illustrate the interaction between quarks through the exchange of 
charged gluons, consider the Feynman diagram in part (b). As time increases, a 
red down quark moves right and a green strange quark moves left. (These 
appear at the lower edge of the graph.) The up quark exchanges a red-antigreen 
gluon with the strange quark. (Anticolors are shown as secondary colors. For 
example, antired is represented by cyan because cyan mixes with red to form 
white light.) According to QCD, all interactions in this process—identified with 
the vertices—must be color neutral. Therefore, the down quark transforms from 
red to green, and the strange quark transforms from green to red. 


Colo 
Anticolo 


a 4 


(a) (b) 


(a) Eight types of gluons carry the strong nuclear force. The white gluons 
are mixtures of color-anticolor pairs. (b) An interaction between two 
quarks through the exchange of a gluon. 


As suggested by this example, the interaction between quarks in an atomic 
nucleus can be very complicated. [link] shows the interaction between a proton 
and neutron. Notice that the proton converts into a neutron and the neutron 
converts into a proton during the interaction. The presence of quark-antiquark 
pairs in the exchange suggest that bonding between nucleons can be modeled as 
an exchange of pions. 
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A Feynman diagram that describes a strong nuclear interaction 
between a proton and a neutron. 


In practice, QCD predictions are difficult to produce. This difficulty arises from 
the inherent strength of the force and the inability to neglect terms in the 
equations. Thus, QCD calculations are often performed with the aid of 
supercomputers. The existence of gluons is supported by electron-nucleon 
scattering experiments. The estimated quark momenta implied by these 
scattering events are much smaller than we would expect without gluons 
because the gluons carry away some of the momentum of each collision. 


Unification Theories 


Physicists have long known that the strength of an interaction between particles 
depends on the distance of the interaction. For example, two positively charged 
particles experience a larger repulsive force at a short distance then at a long 
distance. In scattering experiments, the strength of an interaction depends on the 
energy of the interacting particle, since larger energy implies both closer and 
stronger interactions. 


Particle physicists now suspect that the strength of all particle interactions (the 
four forces) merge at high energies, and the details of particle interactions at 
these energies can be described in terms of a single force ([link]). A unified 
theory describes what these interactions are like and explains why this 
description breaks down at low-energy scales. A grand unified theory is a 
theory that attempts to describe strong and electroweak interaction in terms of 
just one force. A theory of everything (TOE) takes the unification concept one 
step further. A TOE combines all four fundamental forces (including gravity) 
into one theory. 
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Grand unification of forces at high energies. 


Summary 


e The Standard Model describes interactions between particles through the 
strong nuclear, electromagnetic, and weak nuclear forces. 

e Particle interactions are represented by Feynman diagrams. A Feynman 
diagram represents interactions between particles on a space-time graph. 

e Electromagnetic forces act over a long range, but strong and weak forces 
act over a short range. These forces are transmitted between particles by 
sending and receiving bosons. 

e Grand unified theories seek an understanding of the universe in terms of 
just one force. 


Conceptual Questions 


Exercise: 
Problem: 


What is the Standard Model? Express your answer in terms of the four 
fundamental forces and exchange particles. 


Solution: 


The Standard Model is a model of elementary particle interactions. This 
model contains the electroweak theory and quantum chromodynamics 
(QCD). It describes the interaction of leptons and quarks though the 
exchange of photons (electromagnetism) and bosons (weak theory), and 
the interaction of quark through the exchange of gluons (QCD). This 
model does not describe gravitational interactions. 


Exercise: 


Problem: 


Draw a Feynman diagram to represents annihilation of an electron and 
positron into a photon. 


Exercise: 


Problem: What is the motivation behind grand unification theories? 


Solution: 


To explain particle interactions that involve the strong nuclear, 
electromagnetic, and weak nuclear forces in a unified way. 


Exercise: 
Problem: 
If a theory is developed that unifies all four forces, will it still be correct to 


say that the orbit of the Moon is determined by the gravitational force? 
Explain why. 


Exercise: 
Problem: 


If the Higgs boson is discovered and found to have mass, will it be 
considered the ultimate carrier of the weak force? Explain your response. 


Solution: 


No, however it will explain why the W and Z bosons are massive (since 
the Higgs “imparts” mass to these particles), and therefore why the weak 
force is short ranged. 


Exercise: 
Problem: 
One of the common decay modes of the A° is A° — 27 + p. Even though 
only hadrons are involved in this decay, it occurs through the weak nuclear 


force. How do we know that this decay does not occur through the strong 
nuclear force? 


Problems 


Exercise: 
Problem: 
Using the Heisenberg uncertainly principle, determine the range of the 
weak force if this force is produced by the exchange of a Z boson. 


Exercise: 


Problem: 


Use the Heisenberg uncertainly principle to estimate the range of a weak 
nuclear decay involving a graviton. 


Solution: 


The graviton is massless, so like the photon is associated with a force of 
infinite range. 


Exercise: 
Problem: (a) The following decay is mediated by the electroweak force: 
pron+et+ ve. 
Draw the Feynman diagram for the decay. 
(b) The following scattering is mediated by the electroweak force: 
Vee > UePe . 


Draw the Feynman diagram for the scattering. 
Exercise: 
Problem: 


Assuming conservation of momentum, what is the energy of each + ray 


produced in the decay of a neutral pion at rest, in the reaction 7° + + 7 
i) 


Solution: 


67.5 MeV 
Exercise: 
Problem: 


What is the wavelength of a 50-GeV electron, which is produced at 
SLAC? This provides an idea of the limit to the detail it can probe. 


Exercise: 


Problem: 


The primary decay mode for the negative pion ism — w+ ae (a) What 
is the energy release in MeV in this decay? (b) Using conservation of 
momentum, how much energy does each of the decay products receive, 
given the 7” is at rest when it decays? You may assume the muon 
antineutrino is massless and has momentum p = E//c, just like a photon. 


Solution: 


a. 33.9 MeV; b. By conservation of momentum, |p,,| = |p,| = p. By 
conservation of energy, £,, = 29.8 MeV, E,, = 4.1 MeV 


Exercise: 


Problem: 


Suppose you are designing a proton decay experiment and you can detect 
50 percent of the proton decays in a tank of water. (a) How many 
kilograms of water would you need to see one decay per month, assuming 
a lifetime of 10°! y? (b) How many cubic meters of water is this? (c) If the 
actual lifetime is 10°? y, how long would you have to wait on an average 
to see a single proton decay? 


Glossary 


Feynman diagram 
space-time diagram that describes how particles move and interact 


quantum chromodynamics (QCD) 
theory that describes strong interactions between quarks 


quantum electrodynamics (QED) 
theory that describes the interaction of electrons with photons 


Standard Model 
model of particle interactions that contains the electroweak theory and 
quantum chromodynamics (QCD) 


virtual particle 
particle that exists for too short of time to be observable 


The Big Bang 
By the end of this section, you will be able to: 


e Explain the expansion of the universe in terms of a Hubble graph and 
cosmological redshift 

e Describe the analogy between cosmological expansion and an 
expanding balloon 

e Use Hubble’s law to make predictions about the measured speed of 
distant galaxies 


We have been discussing elementary particles, which are some of the 
smallest things we can study. Now we are going to examine what we know 
about the universe, which is the biggest thing we can study. The link 
between these two topics is high energy: The study of particle interactions 
requires very high energies, and the highest energies we know about existed 
during the early evolution of the universe. Some physicists think that the 
unified force theories we described in the preceding section may actually 
have governed the behavior of the universe in its earliest moments. 


Hubble’s Law 


In 1929, Edwin Hubble published one of the most important discoveries in 
modern astronomy. Hubble discovered that (1) galaxies appear to move 
away from Earth and (2) the velocity of recession (v) is proportional to the 
distance (d) of the galaxy from Earth. Both v and d can be determined using 
stellar light spectra. A best fit to the sample illustrative data is given in 
[link]. (Hubble’s original plot had a considerable scatter but a general trend 
was still evident.) 


Velocity v (km/s) 


This graph of red shift versus 
distance for galaxies shows a 
linear relationship, with larger red 
shifts at greater distances, 
implying an expanding universe. 
The slope gives an approximate 
value for the expansion rate. 
(credit: John Cub) 


The trend in the data suggests the simple proportional relationship: 


Note: 
Equation: 


O= Ad, 


where Hp = 70 km /s / Mpc is known as Hubble’s constant. (Note: 1 
Mpc is one megaparsec or one million parsecs, where one parsec is 3.26 


light-years.) This relationship, called Hubble’s law, states that distant stars 
and galaxies recede away from us at a speed of 70 km/s for every one 
megaparsec of distance from us. Hubble’s constant corresponds to the slope 
of the line in [link]. Hubble’s constant is a bit of a misnomer, because it 
varies with time. The value given here is only its value today. 


Note: 
Watch this video to learn more about the history of Hubble’s constant. 


Hubble’s law describes an average behavior of all but the closest galaxies. 
For example, a galaxy 100 Mpc away (as determined by its size and 
brightness) typically moves away from us at a speed of 

Equation: 


y= ( (70 a ) [Mpc (100 Mpc) = 7000 km/s. 


Ss. 
This speed may vary due to interactions with neighboring galaxies. 
Conversely, if a galaxy is found to be moving away from us at speed of 


100,000 km/s based on its red shift, it is at a distance 
Equation: 


d = v/Hp = (10,000 km/s)/ ((70S ) [Mpc — 143 Mpe. 


This last calculation is approximate because it assumes the expansion rate 
was the same 5 billion years ago as it is now. 


Big Bang Model 


Scientists who study the origin, evolution, and ultimate fate of the universe 
(cosmology) believe that the universe began in an explosion, called the Big 


Bang, approximately 13.7 billion years ago. This explosion was not an 
explosion of particles through space, like fireworks, but a rapid expansion 
of space itself. The distances and velocities of the outward-going stars and 
galaxies permit us to estimate when all matter in the universe was once 
together—at the beginning of time. 


Scientists often explain the Big Bang expansion using an inflated-balloon 
model ([link]). Dots marked on the surface of the balloon represent 
galaxies, and the balloon skin represents four-dimensional space-time 
(Relativity). As the balloon is inflated, every dot “sees” the other dots 
moving away. This model yields two insights. First, the expansion is 
observed by all observers in the universe, no matter where they are located. 
The “center of expansion” does not exist, so Earth does not reside at the 
“privileged” center of the expansion (see [link]). 


(a) (b) 


An analogy to the expanding universe: The dots move away from each 
other as the balloon expands; compare (a) to (b) after expansion. 


Second, as mentioned already, the Big Bang expansion is due to the 
expansion of space, not the increased separation of galaxies in ordinary 
(static) three-dimensional space. This cosmological expansion affects all 
things: dust, stars, planets, and even light. Thus, the wavelength of light (A) 
emitted by distant galaxies is “stretched” out. This makes the light appear 
“redder” (lower energy) to the observer—a phenomenon called 
cosmological redshift. Cosmological redshift is measurable only for 
galaxies farther away than 50 million light-years. 


Example: 

Calculating Speeds and Galactic Distances 
A galaxy is observed to have a redshift: 
Equation: 


y= Aobs— Acmit _ 4 


Neat 


This value indicates a galaxy moving close to the speed of light. Using the 
relativistic redshift formula (given in Relativity), determine (a) How fast is 
the galaxy receding with respect to Earth? (b) How far away is the galaxy? 
Strategy 

We need to use the relativistic Doppler formula to determine speed from 
redshift and then use Hubble’s law to find the distance from the speed. 
Solution 


a. According to the relativistic redshift formula: 


Equation: 
1 
a page le 
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where 6 = u/c. Substituting the value for z and solving for 6, we get 
8 = 0.93. This value implies that the speed of the galaxy is 
2.8 x 10°m/s. 


b. Using Hubble’s law, we can find the distance to the galaxy if we know 


its 
recession velocity: 
Equation: 
28 < 10° 
q— ae pee See = 3.8 x 10° Mpc. 
Ho 73.8 x 10°m/s per Mpc 
Significance 


Distant galaxies appear to move very rapidly away from Earth. The 
redshift of starlight from these galaxies can be used to determine the 
precise speed of recession, over 90% of the speed of light in this case. This 
motion is not due to the motion of galaxy through space but by the 
expansion of space itself. 


Note: 
Exercise: 


Problem: 
Check Your Understanding The light of a galaxy that moves away 
from us is “redshifted.” What occurs to the light of a galaxy that 


moves toward us? 


Solution: 


blueshifted 


Note: 
View this video to learn more about the cosmological expansion. 


Structure and Dynamics of the Universe 


At large scales, the universe is believed to be both isotropic and 
homogeneous. The universe is believed to isotropic because it appears to be 
the same in all directions, and homogeneous because it appears to be the 
same in all places. A universe that is isotropic and homogeneous is said to 
be smooth. The assumption of a smooth universe is supported by the 
Automated Plate Measurement Galaxy Survey conducted in the 1980s and 
1900s ({link]). However, even before these data were collected, the 
assumption of a smooth universe was used by theorists to simplify models 
of the expansion of the universe. This assumption of a smooth universe is 
sometimes called the cosmological principle. 


The Automated Plate Measurement (APM) Galaxy Survey. Over 2 
million galaxies are depicted in a region 100 degrees across centered 
toward the Milky Way’s south pole. (credit: 2WASS/T. H. Jarrett, J. 

Carpenter, & R. Hurt) 


The fate of this expanding and smooth universe is an open question. 
According to the general theory of relativity, an important way to 
characterize the state of the universe is through the space-time metric: 


Note: 
Equation: 


ds* = c’dt? — a(t)*dX”, 


where c is the speed of light, a is a scale factor (a function of time), and d& 
is the length element of the space. In spherical coordinates (r, 0, ), this 
length element can be written 

Equation: 


2 dr* 2( 702 1. ein 29,2 
dx" = 1— kr? +r (do + sin Ody ), 
— Kr 


where k is a constant with units of inverse area that describes the curvature 
of space. This constant distinguishes between open, closed, and flat 
universes: 


e k = 0 (flat universe) 
¢ k > 0 (closed universe, such as a sphere) 
e k < 0 (open universe, such as a hyperbola) 


In terms of the scale factor a, this metric also distinguishes between static, 
expanding, and shrinking universes: 


¢ a = 1 (static universe) 
e da/dt > 0 (expanding universe) 
e da/dt < 0 (shrinking universe) 


The scale factor a and the curvature k are determined from Einstein’s 
general theory of relativity. If we treat the universe as a gas of galaxies of 
density p and pressure p, and assume k = 0 (a flat universe), than the scale 
factor a is given by 

Equation: 


where G is the universal gravitational constant. (For ordinary matter, we 
expect the quantity p + 3p to be greater than zero.) If the scale factor is 
positive (a > 0), the value of the scale factor “decelerates” (d*a/dt? < 0), 
and the expansion of the universe slows down over time. If the numerator is 
less than zero (somehow, the pressure of the universe is negative), the value 
of the scale factor “accelerates,” and the expansion of the universe speeds 
up over time. According to recent cosmological data, the universe appears 
to be expanding. Many scientists explain the current state of the universe in 
terms of a very rapid expansion in the early universe. This expansion is 
called inflation. 


Summary 


e The universe is expanding like a balloon—every point is receding 
from every other point. 

e Distant galaxies move away from us at a velocity proportional to its 
distance. This rate is measured to be approximately 70 km/s/Mpc. 
Thus, the farther galaxies are from us, the greater their speeds. These 
“recessional velocities” can be measure using the Doppler shift of 
light. 

e According to current cosmological models, the universe began with 
the Big Bang approximately 13.7 billion years ago. 


Conceptual Questions 


Exercise: 


Problem: 


What is meant by cosmological expansion? Express your answer in 
terms of a Hubble graph and the red shift of distant starlight. 


Solution: 


Cosmological expansion is an expansion of space. This expansion is 
different than the explosion of a bomb where particles pass rapidly 
through space. A plot of the recessional speed of a galaxy is 
proportional to its distance. This speed is measured using the red shift 
of distant starlight. 


Exercise: 
Problem: 
Describe the balloon analogy for cosmological expansion. Explain 


why it only appears that we are at the center of expansion of the 
universe. 


Exercise: 
Problem: 
Distances to local galaxies are determined by measuring the brightness 
of stars, called Cepheid variables, that can be observed individually 
and that have absolute brightnesses at a standard distance that are well 


known. Explain how the measured brightness would vary with 
distance, as compared with the absolute brightness. 


Solution: 
With distance, the absolute brightness is the same, but the apparent 


brightness is inversely proportional to the square of its distance (or by 
Hubble’s law recessional velocity). 


Problems 


Exercise: 
Problem: 


If the speed of a distant galaxy is 0.99c, what is the distance of the 
galaxy from an Earth-bound observer? 


Solution: 


(0.99) (299792 km/s) = ((70)/ Mpc)(d), d = 4240 Mpc 
Exercise: 

Problem: 

The distance of a galaxy from our solar system is 10 Mpc. (a) What is 


the recessional velocity of the galaxy? (b) By what fraction is the 
starlight from this galaxy redshifted (that is, what is its z value)? 


Exercise: 
Problem: 


If a galaxy is 153 Mpc away from us, how fast do we expect it to be 
moving and in what direction? 


Solution: 


1.0 x 10*km/s away from us. 
Exercise: 
Problem: 
On average, how far away are galaxies that are moving away from us 
at 2.0% of the speed of light? 


Exercise: 


Problem: 


Our solar system orbits the center of the Milky Way Galaxy. Assuming 
a circular orbit 30,000 ly in radius and an orbital speed of 250 km/s, 
how many years does it take for one revolution? Note that this is 
approximate, assuming constant speed and circular orbit, but it is 
representative of the time for our system and local stars to make one 
revolution around the galaxy. 


Solution: 


2.26 x 10° y 
Exercise: 
Problem: 
(a) What is the approximate velocity relative to us of a galaxy near the 
edge of the known universe, some 10 Gly away? (b) What fraction of 


the speed of light is this? Note that we have observed galaxies moving 
away from us at greater than 0.9c. 


Exercise: 
Problem: 
(a) Calculate the approximate age of the universe from the average 
value of the Hubble constant, Hp = 20 km/s - Mly. To do this, 
calculate the time it would take to travel 0.307 Mpc at a constant 
expansion rate of 20 km/s. (b) If somehow acceleration occurs, would 


the actual age of the universe be greater or less than that found here? 
Explain. 


Solution: 


a.1.5 x 10!°y = 15 billion years; b. Greater, since if it was moving 
slower in the past it would take less more to travel the distance. 


Exercise: 


Problem: 


The Andromeda Galaxy is the closest large galaxy and is visible to the 
naked eye. Estimate its brightness relative to the Sun, assuming it has 
luminosity 10!” times that of the Sun and lies 0.613 Mpc away. 


Exercise: 
Problem: 
Show that the velocity of a star orbiting its galaxy in a circular orbit is 
inversely proportional to the square root of its orbital radius, assuming 
the mass of the stars inside its orbit acts like a single mass at the center 
of the galaxy. You may use an equation from a previous chapter to 


support your conclusion, but you must justify its use and define all 
terms used. 


Solution: 
a / GM 
r 


Glossary 


Big Bang 
rapid expansion of space that marked the beginning of the universe 


cosmology 
study of the origin, evolution, and ultimate fate of the universe 


Hubble’s constant 
constant that relates speed and distance in Hubble’s law 


Hubble’s law 
relationship between the speed and distance of stars and galaxies 


redshift 


lengthening of the wavelength of light (or reddening) due to 
cosmological expansion 


Evolution of the Early Universe 
By the end of this section, you will be able to: 


e Describe the evolution of the early universe in terms of the four 
fundamental forces 

e Use the concept of gravitational lensing to explain astronomical 
phenomena 

e Provide evidence of the Big Bang in terms of cosmic background 
radiation 

e Distinguish between dark matter and dark energy 


In the previous section, we discussed the structure and dynamics of universe. 
In particular, the universe appears to be expanding and even accelerating. But 
what was the universe like at the beginning of time? In this section, we discuss 
what evidence scientists have been able to gather about the early universe and 
its evolution to present time. 


The Early Universe 


Before the short period of cosmic inflation, cosmologists believe that all 
matter in the universe was squeezed into a space much smaller than an atom. 
Cosmologists further believe that the universe was extremely dense and hot, 
and interactions between particles were governed by a single force. In other 
words, the four fundamental forces (strong nuclear, electromagnetic, weak 
nuclear, and gravitational) merge into one at these energies (({link]). How and 
why this “unity” breaks down at lower energies is an important unsolved 
problem in physics. 
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The separation of the four fundamental forces in the early universe. 


Scientific models of the early universe are highly speculative. [link] shows a 
sketch of one possible timeline of events. 
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An approximate timeline for the evolution of the universe from the Big 
Bang to the present. 


1. Big Bang (t < 10~*°s) : The current laws of physics break down. At the 
end of the initial Big Bang event, the temperature of the universe is 
approximately T = 10°?K. 

2. Inflationary phase (t = 10~**to 10~*°s) : The universe expands 
exponentially, and gravity separates from the other forces. The universe 
cools to approximately T’ = 10?’K. 

3. Age of leptons (t = 10~*°to 10~®s) : As the universe continues to 
expand, the strong nuclear force separates from the electromagnetic and 
weak nuclear forces (or electroweak force). Soon after, the weak nuclear 
force separates from the electromagnetic force. The universe is a hot 
soup of quarks, leptons, photons, and other particles. 

4. Age of nucleons (t = 10~%to 225 s) : The universe consists of leptons 
and hadrons (such as protons, neutrons, and mesons) in thermal 
equilibrium. Pair production and pair annihilation occurs with equal ease, 
so photons remain in thermal equilibrium: 

Equation: 


y+tyee+er 
eet a eae: 
ytyon+n. 


The number of protons is approximately equal to the number of neutrons 
through interactions with neutrinos: 
Equation: 


Vetn He +p 


Vetp Oet+n. 


The temperature of the universe settles to approximately 10'4K —much 
too cool for the continued production of nucleon-antinucleon pairs. The 
numbers of protons and neutrons begin to dominate over their anti- 


particles, so proton-antiproton (pp) and neutron-antineutron (nn) 


annihilations decline. Deuterons (proton-neutron pairs) begin to form. 

5. Age of nucleosynthesis (¢ = 225s to 1000 years): As the universe 
continues to expand, deuterons react with protons and neutrons to form 
larger nuclei; these larger nuclei react with protons and neutrons to form 
still larger nuclei. At the end of this period, about 1/4 of the mass of the 
universe is helium. (This explains the current amount of helium in the 
universe.) Photons lack the energy to continue electron-positron 
production, so electrons and positrons annihilate each other to photons 
only. 

6. Age of ions (t = 1000 to 3000 years): The universe is hot enough to 
ionize any atoms formed. The universe consists of electrons, positrons, 
protons, light nuclei, and photons. 

7. Age of atoms (t = 3000 to 300,000 years): The universe cools below 
10°K and atoms form. Photons do not interact strongly with neutral 
atoms, so they “decouple” (separate) from atoms. These photons 
constitute the cosmic microwave background radiation to be discussed 
later. 


8. Age of stars and galaxies (t = 300,000 years to present): The atoms and 
particles are pulled together by gravity and form large lumps. The atoms 
and particles in stars undergo nuclear fusion reaction. 


Note: 
Watch this video to learn more about Big Bang cosmology. 


To describe the conditions of the early universe quantitatively, recall the 
relationship between the average thermal energy of particle (E) in a system of 
interacting particles and equilibrium temperature (T) of that system: 
Equation: 


Bape. 


where kg is Boltzmann’s constant. In the hot conditions of the early universe, 
particle energies were unimaginably large. 


Example: 

What Was the Average Thermal Energy of a Particle just after the Big 
Bang? 

Strategy 

The average thermal energy of a particle in a system of interacting particles 
depends on the equilibrium temperature of that system [link]. We are given 
this approximate temperature in the above timeline. 

Solution 

Cosmologists think the temperature of the universe just after the Big Bang 
was approximately T’ = 10°?K. Therefore, the average thermal energy of a 
particle would have been 

Equation: 


kpgT ~ (1074 eV/K)(10°°K) = 10% eV = 10" GeV. 


Significance 


This energy is many orders of magnitude larger than particle energies 
produced by human-made particle accelerators. Currently, these accelerators 
operate at energies less than 104 GeV. 


Note: 
Exercise: 


Problem: 


Check Your Understanding Compare the abundance of helium by 
mass 10,000 years after the Big Bang and now. 


Solution: 


about the same 


Nucleons form at energies approximately equal to the rest mass of a proton, or 
1000 MeV. The temperature corresponding to this energy is therefore 
Equation: 


1000 MeV 


= U1 2 x 108K, 
8.62 x 10!! MeV - K7! 


Temperatures of this value or higher existed within the first second of the 
early universe. A similar analysis can be done for atoms. Atoms form at an 
energy equal to the ionization energy of ground-state hydrogen (13 eV). The 
effective temperature for atom formation is therefore 

Equation: 


1 
Sy eink 
8.62 x 10°eV-K™ 


This occurs well after the four fundamental forces have separated, including 
forces necessary to bind the protons and neutrons in the nucleus (strong 


nuclear force), and bind electrons to the nucleus (electromagnetic force). 


Nucleosynthesis of Light Elements 


The relative abundances of the light elements hydrogen, helium, lithium, and 
beryllium in the universe provide key evidence for the Big Bang. The data 
suggest that much of the helium in the universe is primordial. For instance, it 
turns out that that 25% of the matter in the universe is helium, which is too 
high an abundance and cannot be explained based on the production of helium 
in stars. 


How much of the elements in the universe were created in the Big Bang? If 
you run the clock backward, the universe becomes more and more 
compressed, and hotter and hotter. Eventually, temperatures are reached that 
permit nucleosynthesis, the period of formation of nuclei, similar to what 
occurs at the core of the Sun. Big Bang nucleosynthesis is believed to have 
occurred within a few hundred seconds of the Big Bang. 


How did Big Bang nucleosynthesis occur? At first, protons and neutrons 
combined to form deuterons, 7H. The deuteron captured a neutron to form 
triton, 7H —the nucleus of the radioactive hydrogen called tritium. Deuterons 
also captured protons to make helium *He. When °H captures a proton or *He 
captures a neutron, helium 4He results. At this stage in the Big Bang, the ratio 
of protons to neutrons was about 7:1. Thus, the process of conversion to *He 
used up almost all neutrons. The process lasted about 3 minutes and almost 
25% of all the matter turned into *He, along with small percentages of 2H, °H, 
and °He. Tiny amounts of “Li and “Be were also formed. The expansion 
during this time cooled the universe enough that the nuclear reactions stopped. 
The abundances of the light nuclei 7H, “He, and “Li created after the Big 
Bang are very dependent on the matter density. 


The predicted abundances of the elements in the universe provide a stringent 
test of the Big Bang and the Big Bang nucleosynthesis. Recent experimental 
estimates of the matter density from the Wilkinson Microwave Anisotropy 
Probe (WMAP) agree with model predictions. This agreement provides 
convincing evidence of the Big Bang model. 


Cosmic Microwave Background Radiation 


According to cosmological models, the Big Bang event should have left 
behind thermal radiation called the cosmic microwave background radiation 
(CMBR). The intensity of this radiation should follow the blackbody radiation 
curve (Photons and Matter Waves). Wien’s law states that the wavelength of 
the radiation at peak intensity is 

Equation: 


\__ — 2-898 x 10-%m-K 
max ~~ T r] 


where T is temperature in kelvins. Scientists expected the expansion of the 
universe to “stretch the light,” and the temperature to be very low, so cosmic 
background radiation should be long-wavelength and low energy. 


In the 1960s, Arno Penzias and Robert Wilson of Bell Laboratories noticed 
that no matter what they did, they could not get rid of a faint background noise 
in their satellite communication system. The noise was due to radiation with 
wavelengths in the centimeter range (the microwave region). Later, this noise 
was associated with the cosmic background radiation. An intensity map of the 
cosmic background radiation appears in [link]. The thermal spectrum is 
modeled well by a blackbody curve that corresponds to a temperature 

f= 72.7%. ([link)): 


This map of the sky uses color to show fluctuations, or wrinkles, in the 
cosmic microwave background observed with the WMAP spacecraft. 
The Milky Way has been removed for clarity. Red represents higher 
temperature and higher density, whereas blue indicates lower temperature 
and density. This map does not contradict the earlier claim of smoothness 
because the largest fluctuations are only one part in one million. (credit: 
NASA/WMAP Science Team) 
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Intensity distribution of cosmic microwave background radiation. The 
model predictions (the line) agree extremely well with the experimental 
results (the dots). Frequency and brightness values are shown on a log 
axis. (credit: modification of work by George Smoot/NASA COBE 
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The formation of atoms in the early universe makes these atoms less likely to 
interact with light. Therefore, photons that belong to the cosmic background 

radiation must have separated from matter at a temperature T associated with 
1 eV (the approximate ionization energy of an atom) . The temperature of the 
universe at this point was 


Equation: 


1 
Pare ie = = 
8.617 x 105°eV/K 


According to cosmological models, the time when photons last scattered off 
charged particles was approximately 380,000 years after the Big Bang. Before 
that time, matter in the universe was in the plasma form and the photons were 
“thermalized.” 


Antimatter and Matter 


We know from direct observation that antimatter is rare. Earth and the solar 
system are nearly pure matter, and most of the universe also seems dominated 
by matter. This is proven by the lack of annihilation radiation coming to us 
from space, particularly the relative absence of 0.511-MeV 7 rays created by 
the mutual annihilation of electrons and positrons. (Antimatter in nature is 
created in particle collisions and in G* decays, but only in small amounts that 
quickly annihilate, leaving almost pure matter surviving.) 


Despite the observed dominance of matter over antimatter in the universe, the 
Standard Model of particle interactions and experimental measurement 
suggests only small differences in the ways that matter and antimatter interact. 
For example, neutral kaon decays produce only slightly more matter than 
antimatter. Yet, if through such decay, slightly more matter than antimatter 
was produced in the early universe, the rest could annihilate pair by pair, 
leaving mostly ordinary matter to form the stars and galaxies. In this way, the 
vast number of stars we observe may be only a tiny remnant of the original 
matter created in the Big Bang. 


Dark Matter and Dark Energy 


In the last two decades, new and more powerful techniques have revealed that 
the universe is filled with dark matter. This type of matter is interesting and 
important because, currently, scientists do not know what it is! However, we 
can infer its existence by the deflection of distant starlight. For example, if 
light from a distant galaxy is bent by the gravitational field of a clump of dark 
matter between us and the galaxy, it is possible that two images of the same 
galaxy can be produced ([link]). The bending of light by the gravitational field 


of matter is called gravitational lensing. In some cases, the starlight travels to 
an observer by multiple paths around the galaxy, producing a ring ([Link]). 


Based on current research, scientist know only that dark matter is cold, slow 
moving, and interacts weakly with ordinary matter. Dark matter candidates 
include neutralinos (partners of Z bosons, photons, and Higgs bosons in 
“supersymmetry theory”) and particles that circulate in tiny rings set up by 
extra spatial dimensions. 
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Light from a distant star is bent around a galaxy. Under the 
right conditions, two duplicate images of the same star can be 
seen. 


Light from a distant star is bent around a galaxy. Under the right 
conditions, we can see a ring of light instead of a single star. (credit: 
modification of work by ESA/Hubble & NASA) 


Increasingly precise astronomical measurements of the expanding universe 
also reveal the presence of a new form of energy called dark energy. This 
energy is thought to explain larger-than-expected values for the observed 
galactic redshifts for distant galaxies. These redshifts suggest that the universe 
is not only expanding, but expanding at an increasing rate. Virtually nothing is 
known about the nature and properties of dark energy. Together, dark energy 
and dark matter represent two of the most interesting and unsolved puzzles of 
modern physics. Scientists attribute 68.3% of the energy of the universe to 
dark energy, 26.8% to dark matter, and just 4.9% to the mass-energy of 
ordinary particles ( ). Given the current great mystery over the nature of 
dark matter and dark energy, Isaac Newton’s humble words are as true now as 
they were centuries ago: 


“T do not know what I may appear to the world, but to myself I seem to have 
been only like a boy playing on the sea-shore, and diverting myself in now 
and then finding a smoother pebble or a prettier shell than ordinary, whilst the 
great ocean of truth lay all undiscovered before me.” 


Dark 
energy 
68.3% 


Atoms 4.9% 


Dark matter 26.8% 


Estimated distribution of matter and energy in the universe. (credit: 
NASA/WMAP Science Team) 


Summary 


e The early universe was hot and dense. 

e The universe is isotropic and expanding. 

¢ Cosmic background radiation is evidence for the Big Bang. 

e The vast portion of the mass and energy of the universe is not well 
understood. 


Key Equations 


Momentum of 

oneal p = 0.3Br 
particle ina 

cyclotron 


Center-of- 
mass energy of 
a colliding 
beam machine 


W? = 2[E, Ey + (pyc) (poe)] + (mic?)” + (mgc?)? 


Approximate 

time for 

exchange of a Kee 
virtual particle 

between two 

other particles 


ty|> 


Hubble’s law v = Hod 


Cosmological 
space-time ds? = c?dt? — a(t)*d’ 
metric 


Conceptual Questions 


Exercise: 


Problem: 


What is meant by a “cosmological model of the early universe?” Briefly 
describe this model in terms of the four fundamental forces. 


Exercise: 


Problem: 


Describe two pieces of evidence that support the Big Bang model. 


Solution: 


The observed expansion of the universe and the cosmic background 
radiation spectrum. 


Exercise: 
Problem: 
In what sense are we, as Newton once said, “a boy playing on the sea- 


shore”? Express your answer in terms of the concepts of dark matter and 
dark energy. 


Exercise: 
Problem: 
If some unknown cause of redshift—such as light becoming “tired” from 


traveling long distances through empty space—is discovered, what effect 
would that have on cosmology? 


Solution: 


If light slow down, it takes long to reach Earth than expected. We 
conclude that the object is much closer than it really is. Thus, for every 
recessional velocity (based on the frequency of light, which we assume is 
not disturbed by the slowing), the distance is smaller than the “true” 
value, Hubble’s constant is larger than the “true” value, and the age of the 
universe is smaller than the “true” value. 


Exercise: 
Problem: 
In the past, many scientists believed the universe to be infinite. However, 
if the universe is infinite, then any line of sight should eventually fall on 


a star’s surface and the night sky should be very bright. How is this 
paradox resolved in modern cosmology? 


Additional Problems 


Exercise: 


Problem: 


Experimental results suggest that a muon decays to an electron and 
photon. How is this possible? 

Exercise: 
Problem: 


Each of the following reactions is missing a single particle. Identify the 
missing particle for each reaction. 


a)p+p—7n+? 
b)p+p—>p+A°+? 


Solution: 


ane beK eK Car fey toe" 
Exercise: 


Problem: 


Because of energy loss due to synchrotron radiation in the LHC at 
CERN, only 5.00 MeV is added to the energy of each proton during each 
revolution around the main ring. How many revolutions are needed to 
produce 7.00-TeV (7000 GeV) protons, if they are injected with an initial 
energy of 8.00 GeV? 


Exercise: 


Problem: 


A proton and an antiproton collide head-on, with each having a kinetic 
energy of 7.00 TeV (such as in the LHC at CERN). How much collision 
energy is available, taking into account the annihilation of the two 
masses? (Note that this is not significantly greater than the extremely 
relativistic kinetic energy.) 


Solution: 


14.002 TeV = 14.0 TeV 

Exercise: 
Problem: 
When an electron and positron collide at the SLAC facility, they each 
have 50.0-GeV kinetic energies. What is the total collision energy 
available, taking into account the annihilation energy? Note that the 


annihilation energy is insignificant, because the electrons are highly 
relativistic. 


Exercise: 
Problem: 
The core of a star collapses during a supernova, forming a neutron star. 
Angular momentum of the core is conserved, so the neutron star spins 
rapidly. If the initial core radius is 5.0 x 10° km and it collapses to 10.0 
km, find the neutron star’s angular velocity in revolutions per second, 


given the core’s angular velocity was originally 1 revolution per 30.0 
days. 


Solution: 


964 rev/s 


Exercise: 


Problem: 


Using the solution from the previous problem, find the increase in 
rotational kinetic energy, given the core’s mass is 1.3 times that of our 
Sun. Where does this increase in kinetic energy come from? 


Exercise: 


Problem: 


(a) What Hubble constant corresponds to an approximate age of the 
universe of 102° y? To get an approximate value, assume the expansion 
rate is constant and calculate the speed at which two galaxies must move 
apart to be separated by 1 Mly (present average galactic separation) in a 
time of 10° y. (b) Similarly, what Hubble constant corresponds to a 
universe approximately 2 x 10!° years old? 


Solution: 

a. Hy = 3" = 30km/s- Mly;b. Ho = 43m = 15km/s - Mly 
Challenge Problems 
Exercise: 

Problem: 


Electrons and positrons are collided in a circular accelerator. Derive the 
expression for the center-of-mass energy of the particle. 


Exercise: 


Problem: 


The intensity of cosmic ray radiation decreases rapidly with increasing 
energy, but there are occasionally extremely energetic cosmic rays that 
create a shower of radiation from all the particles they create by striking a 
nucleus in the atmosphere. Suppose a cosmic ray particle having an 
energy of 10!° GeV converts its energy into particles with masses 
averaging 200MeV/c’. 


(a) How many particles are created? (b) If the particles rain down on a 
1.00-km? area, how many particles are there per square meter? 


Solution: 


a.5 x 10°; b. divide the number of particles by the area they hit: 
5 x 10*particles/m? 


Exercise: 


Problem: 


Seaese3 : a 1 : 
(a) Calculate the relativistic quantity ~ = Vive for 1.00-TeV protons 
produced at Fermilab. (b) If such a proton created a 7* having the same 
speed, how long would its life be in the laboratory? (c) How far could it 


travel in this time? 
Exercise: 


Problem: 


Plans for an accelerator that produces a secondary beam of K mesons to 
scatter from nuclei, for the purpose of studying the strong force, call for 
them to have a kinetic energy of 500 MeV. (a) What would the relativistic 
quantity y = a be for these particles? (b) How long would their 
average lifetime be in the laboratory? (c) How far could they travel in 
this time? 


Solution: 


a. 2.01; b. 2.50 x 1078s; c.6.50m 


Exercise: 


Problem: 


In supernovae, neutrinos are produced in huge amounts. They were 
detected from the 1987A supernova in the Magellanic Cloud, which is 
about 120,000 light-years away from Earth (relatively close to our Milky 
Way Galaxy). If neutrinos have a mass, they cannot travel at the speed of 
light, but if their mass is small, their velocity would be almost that of 
light. (a) Suppose a neutrino with a 7-eV/c? mass has a kinetic energy of 
700 keV. Find the relativistic quantity y = Twa Fe for it. (b) If the 


neutrino leaves the 1987A supermova at the same time as a photon and 
both travel to Earth, how much sooner does the photon arrive? This is not 
a large time difference, given that it is impossible to know which 
neutrino left with which photon and the poor efficiency of the neutrino 
detectors. Thus, the fact that neutrinos were observed within hours of the 
brightening of the supernova only places an upper limit on the neutrino’s 
mass. (Hint: You may need to use a series expansion to find v for the 
neutrino, since its ¥y is so large.) 


Exercise: 


Problem: 


Assuming a circular orbit for the Sun about the center of the Milky Way 
Galaxy, calculate its orbital speed using the following information: The 
mass of the galaxy is equivalent to a single mass 1.5 x 101! times that 
of the Sun (or 3 x 104! kg), located 30,000 ly away. 


Solution: 
mv _ GMm 
r Tz, 2 => 
= GM \1/2 _ (6.67 x 10-1! N-m?/kg?) (3 x 10"! kg) 7 : 
——— ( A ) (30,000 ly) (9.46 x 10" m/ly) = 27 10" mis 


Exercise: 


Problem: 


(a) What is the approximate force of gravity on a 70-kg person due to the 
Andromeda Galaxy, assuming its total mass is 102° that of our Sun and 
acts like a single mass 0.613 Mpc away? (b) What is the ratio of this 
force to the person’s weight? Note that Andromeda is the closest large 
galaxy. 


Exercise: 


Problem: 


(a) A particle and its antiparticle are at rest relative to an observer and 
annihilate (completely destroying both masses), creating two + rays of 
equal energy. What is the characteristic y-ray energy you would look for 
if searching for evidence of proton-antiproton annihilation? (The fact that 
such radiation is rarely observed is evidence that there is very little 
antimatter in the universe.) (b) How does this compare with the 0.511- 
MeV energy associated with electron-positron annihilation? 


Solution: 


a. 938.27 MeV: b. 1.84 x 10° 
Exercise: 


Problem: 


The peak intensity of the CMBR occurs at a wavelength of 1.1 mm. (a) 
What is the energy in eV of a 1.1-mm photon? (b) There are 
approximately 10° photons for each massive particle in deep space. 
Calculate the energy of 10° such photons. (c) If the average massive 
particle in space has a mass half that of a proton, what energy would be 
created by converting its mass to energy? (d) Does this imply that space 
is “matter dominated”? Explain briefly. 


Exercise: 


Problem: 


(a) Use the Heisenberg uncertainty principle to calculate the uncertainty 
in energy for a corresponding time interval of 10~*%s. (b) Compare this 
energy with the 10'? GeV unification-of-forces energy and discuss why 
they are similar. 


Solution: 


a. 3.29 x 10'®GeV = 3 x 10!8 GeV: b. 0.3; Unification of the three 
forces breaks down shortly after the separation of gravity from the 
unification force (near the Planck time interval). The uncertainty in time 
then becomes greater. Hence the energy available becomes less than the 
needed unification energy. 


Glossary 


cosmic microwave background radiation (CMBR) 
thermal radiation produced by the Big Bang event 


dark energy 
form of energy believed to be responsible for the observed acceleration 
of the universe 


dark matter 
matter in the universe that does not interact with other particles but that 
can be inferred by deflection of distance star light 


nucleosynthesis 
creation of heavy elements, occurring during the Big Bang 


Units 


Quantity 
Acceleration 


Amount of 
substance 


Angle 


Angular 
acceleration 


Angular 
frequency 


Angular 
momentum 


Angular 
velocity 


Area 


Atomic number 


Capacitance 


Charge 


Charge density: 


Line 


Surface 


Volume 


Common 
Symbol 


1 


q, Q, e 


mole 
radian (rad) 


rad/s? 


rad/s 


kg -m?/s 


rad/s 


farad (F) 


coulomb (C) 


C/m 
C/m2 


C/m? 


Unit in Terms of 


Base SI Units 


kg -m?/s 


Quantity 
Conductivity 
Current 
Current density 
Density 


Dielectric 
constant 


Electric dipole 
moment 


Electric field 
Electric flux 


Electromotive 
force 


Energy 

Entropy 

Force 

Frequency 

Heat 

Inductance 

Length: 
Displacement 
Distance 


Position 


E,U,K 


Unit 
1/Q-m 
ampere 
A/m? 


kg/m? 


C-m 


N/C 


N-m?/C 
volt (V) 


joule (J) 
J/K 
newton (N) 
hertz (Hz) 
joule (J) 
henry (H) 


meter 


Unit in Terms of 
Base SI Units 


A’. s3/kg-m? 


Quantity 


Magnetic 
dipole moment 


Magnetic field 


Magnetic flux 


Mass 


Molar specific 
heat 


Moment of 
inertia 


Momentum 
Period 


Permeability of 
free space 


Permittivity of 
free space 


Potential 


Power 


Pressure 


Resistance 
Specific heat 
Speed 
Temperature 


Time 


Common 
Symbol 


SO! 


Lo 


E0 


Unit 


N-J/T 


tesla (T) = (Wb/m”) 


weber (Wb) 


kilogram 
J/mol-K 


kg - m? 


kg- m/s 


S 


N/A*= (H/m) 


C?/N -m?= (F/m) 


volt (V) = (J/C) 


watt (W) = (J/s) 


pascal (Pa) = (N/m”) 


ohm (Q) = (V/A) 
J/kg -K 

m/s 

kelvin 


second 


Unit in Terms of 
Base SI Units 


Common Unit in Terms of 


Quantity Symbol Unit Base SI Units 
Torque e N-m kg - m?/s? 
Velocity Vv m/s m/s 

Volume V m? m? 
Wavelength aN m m 

Work W joule (J) = (N- m) kg - m?/s? 


Units Used in Physics (Fundamental units in bold) 


Conversion Factors 


1 meter 

1 centimeter 
1 kilometer 
1 inch 

1 foot 

1 mile 

1 angstrom 
1 fermi 


1 light-year 


1 meter 

1 centimeter 
1 kilometer 
1 inch 

1 foot 

1 mile 


Length 
Area 


1 cm? = 0.155 in.” 


2.540 x 10-2 


0.3048 
1609 
19710 


19°15 


in. 

39.37 

0.3937 

3.937 x 104 
1 

12 


6.336 x 104 


1 m? = 10‘ cm? = 10.76 ft? 


1 in.? = 6.452 cm? 


1 ft? = 144 in.” = 0.0929 m? 


Volume 


2.540 
30.48 


1.609 x 104 


ft 


3.281 


3.281 x 10°? 


3.281 x 10° 


8.333 x 10°? 


1 


5280 


1 liter = 1000 cm? = 107? m3 = 0.03531 ft? = 61.02 in.® 


3.048 x 1074 


1.609 


9.460 x 10” 


6.214 x 10-4 


6.214 x 10°° 


1.578 x 10-5 


1.894 x 1074 


1 ft? = 0.02832 m? = 28.32 liters = 7.477 gallons 


1 gallon = 3.788 liters 


1 
second 


1 
minute 


1 hour 
1 day 
1 year 


Time 


60 


3600 
8.640 x 104 


3.156 x 10’ 


1 meter/second 


1 centimeter/second 


1 foot/second 


1 mile/hour 


Speed 


Acceleration 


min 


1.667 x 10°? 


60 
1440 


5.259 x 10° 


1 m/s” = 100 cm/s” = 3.281 ft/s” 


1 cm/s” = 0.01 m/s” = 0.03281 ft/s” 


1 ft/s” = 0.3048 m/s” = 30.48 cm/s” 


1 mi/h- s = 1.467 ft/s” 


h day yr 


2.778 x 10-4 1.157 x 10-5 3.169 x 10°8 


1.667 x 10°? 6.944 x 10-4 1.901 x 10°° 


1 4.167 x 10°? 1.141 x 107+ 
24 1 2.738 x 107-3 
8.766 x 10° 365.25 1 

cm/s ft/s mi/h 

10° 3.281 0,037 

i! 3.281 x 10° 2.237 x 10°? 

30.48 1 0.6818 

44.70 1.467 1 


kg g slug u 
1 kilogram 1 10° 6.852 x 10°? 6.024 x 1076 
1 gram 10° 1 6.852 x 10° 6.024 x 107° 
1 slug 14.59 1.459 x 10+ 1 8.789 x 102” 
1 atomic mass unit 1.661 x 10-2” 1.661 x 10-4 1.138 x 10°78 1 
1 metric ton 1000 
Mass 
N dyne Ib 
1 newton 1 10° 0.2248 
1 dyne 10° 1 2.248 x 10-6 
1 pound 4.448 4.448 x 10° il 
Force 
Pa dyne/cm? atm cmHg Ib/in.” 
1 pascal 1 10 9.869 x 10°° 7.501 x 10-4 =1.450 x 1 
aauesiaee 10 1 9.869 x 10°’ ~—_7.501 x 10°° 1.450 x 1 
1 atmosphere 1.013 x 10° 1.013 x 10° d. 76 14.70 
eae 1.333 x 10? 1.333 x 101.316 x 102 91 0.1934 
1 pound/inch? 6.895 x 10° 6.895 x 10* = 6.805 x 10°?_—s 5.171 1 
1 bar 10° 


1 torr 1 (mmHg) 
*Where the acceleration due to gravity is 9.80665 m/ s” and the temperature is 0°C 


Pressure 


J erg 
1 joule 1 10’ 
1 erg 10.7 1 
1 foot-pound 1.356 1.356 x 10’ 
1 electron-volt 1.602 x 10°19 1.602 x 10°” 
1 calorie 4.186 4.186 x 10° 
1 British thermal unit 1.055 x 10° 1.055 x 107° 
1 kilowatt-hour 3.600 x 10° 

eV cal 
1 joule 6.242 x 108 0.2389 
1 erg 6.242 x 10" 2.389 x 10°° 
1 foot-pound 8.464 x 10'8 0.3239 
1 electron-volt 1 3.827 x 10°7° 
1 calorie 2.613 x 10! 1 
1 British thermal unit 6.585 x 1071 2.520 x 10? 


Work, Energy, Heat 

Power 

1W=14J/s 

1Lhp = 746 W = 550ft - lb/s 
1 Btu/h = 0.293 W 

Angle 

1 rad = 57.30° = 180° /n 

1° = 0.01745 rad = 1/180 rad 
1 revolution = 360° = 2nrad 


1 rev/min (rpm) = 0.1047 rad/s 


10-8 


190719 


10? 


10-4 
1o7ll 
10°° 
10722 


107° 


Fundamental Constants 


Quantity Symbol Value 
Atomic mass 1.660 538 782 (83) x 10°?’ kg 
u 
unit 931.494 028 (23) MeV/c? 
Avogadro’s 8 ed 3 -1 
Na 6.02214076 x 10°° reciprocal mole(mol ) 
number 
Pou yy = 9.274 009 15 (23) x 10-43 /T 
magneton 2me 
Bohr radius ag = —*>- 5.291 772 085 9 (36) x 10m 
Boltzmann’s _ oR 93+ : 4 
kp =} : : 
Rae B= Hy, 1.380649 x 108 joule per kelvin(J - K"') 
Pompton Ao = 2.426 310 2175 (33) x 107!?m 
wavelength is 
Coulomb jy So 9 22 
eee 8.987 551 788... x 10° N-m*/C’ (exact) 
constant 0 


3.343 583 20 (17) x 10°77’ kg 
eri ma 2.013 553 212 724 (78) u 
1875.612 859 MeV/c? 


9.109 382 15 (45) x 10-2!kg 
ee Me 5.485 799 0943 (23) x 10-4u 
0.510 998 910 (13) MeV/c? 


Electron volt eV 1.602 176 487 (40) x 10°19J 


Quantity Symbol Value 
Eonenny le 1.602176634 x 107°C 
charge 
Gas constant R 8.314 472 (15) J/mol- K 
Gravitational “4 An 2 
Reem G 6.674 28 (67) x 10°“ N-m*/kg 
1.674 927 211 (84) x 10°?" kg 
a Mn 1.008 664 915 97 (43) u 
939.565 346 (23) MeV/c? 
Bue ae 5.050 783 24 (13) x 10-275 /T 
magneton P 
Permeability Im. 
of free Space [40 An x 10°‘ T- m/A (exact) 
Penmitvity gg = 4 8.854 187 817... x 10-!2C2/N - m2 (exact) 
of free space Po 
Planck’s h 6.62607015 x 10°-“*kg- m?-s1 
constant A= 3 1.05457182 x 10-4kg-m?-s7 
1.672 621 637 (83) x 10°?’ kg 
Proton mass Mp 1.007 276 466 77 (10) u 
938.272 013 (23) MeV/c? 
Bycbere Ry 1.097 373 156 852 7 (73) x 107m! 
constant 
Speed of 
light in c 2.997 92458 x 10°m/s (exact) 
vacuum 


Fundamental ConstantsNote: These constants are the values recommended in 2006 by 
CODATA, based on a least-squares adjustment of data from different measurements. The 
numbers in parentheses for the values represent the uncertainties of the last two digits. 


Useful combinations of constants for calculations: 

he = 12,400 eV - A = 1240 eV - nm = 1240 MeV: fm 
fc = 1973 eV - A = 197.3 eV - nm = 197.3 MeV - fm 
kee? = 14.40eV-A=1.440eV- nm = 1.440 MeV- fm 


kpT = 0.02585 eV at T = 300K 


Astronomical Data 


Celestial 
Object 


Sun 
Mercury 


Venus 


Earth 


Mars 


Jupiter 


Saturn 


Uranus 


Neptune 


Mean 
Distance 
from Sun 
(million 
km) 


227.9 


778.4 


1426.7 


2871.0 


4498.3 


Period of 
Revolution 
(d = days) 
(y = years) 


88 d 


224.7 d 


365.26 d 


687 d 


11.9 y 


29.5 6 


84.0 y 


164.8 y 


Period 
of 
Rotation 
at 
Equator 


27d 
59 d 
243 d 


23h 56 
min4s 


24h 37 
min 235 


9h50 
min 30s 


10h 14 
min 


17h 14 
min 


16h 


Eccentricity 


of Orbit 


0.093 


0.048 


0.054 


0.047 


0.009 


Celestial 
Object 


Earth’s 
Moon 


Celestial 
Object 


Sun 
Mercury 
Venus 
Earth 
Mars 
Jupiter 
Saturn 
Uranus 
Neptune 


Earth’s 
Moon 


Mean 
Distance 
from Sun 
(million 
km) 
149.6 
(0.386 
from 
Earth) 
Equatorial 
Diameter 
(km) 
1,392,000 
4879 
12,104 
12,756 
6794 
142,984 
120,536 
51,118 


49,528 


3476 


Astronomical Data 


Other Data: 


Period of 
Revolution 
(d = days) 
(y = years) 


27.3d 


333,000.00 
0.06 

0.82 

1.00 

0.11 
317.83 
95.16 
14.54 


17.15 


0.01 


Period 
of 
Rotation 
at 
Equator 


27.3d 


Density 
(g/cm?) 


1.4 
3.4 
5.2 
9.0 
3.9 
1.3 
0.7 
1.3 


1.6 


3.3 


Eccentricity 


of Orbit 


0.055 


Mass of Earth: 5.97 x 10%kg 
Mass of the Moon: 7.36 x 10??kg 


Mass of the Sun: 1.99 x 10°? kg 


Mathematical Formulas 


Quadratic formula 


If ax? + br +c =0, then x = 


Triangle of base b and 
height h 


Circle of radius r 
Sphere of radius r 


Cylinder of radius r and 
height h 


Geometry 
Trigonometry 


Trigonometric Identities 


.sin 6 = 1/csc 0 
.cos 0 = 1/sec 6 

. tan 0 = 1/cot 0 

.sin (90° — 6) = cos 0 
. COS (90° — 6) = sin 0 
. tan (90° — 6) = cot? 
.sin? 9+ cos?6=1 
.sec? §@—tan?9=1 


CON DU BRWNY RP 


—b+/b2—4ac 
2a 
Area = +bh 


2 


Circumference = 27r 


Surface area = 47? 


Area of curved surface 
= 2rrh 


Area = 7r 


Volume 
ats, AL 23 
= 3ar 


Volume 
= arr*h 


2 


9. tan 0 = sin 0/cos 0 
10. sin (a + 8) = sinacos 8 + cosasin B 
11. cos(a + 8) = cosacos 8 F sina sin B 


t af 
12. tan (a + 8) = FS 


13. sin 20 = 2sin 6 cos 0 

14. cos 20 = cos” 6 — sin? 6 = 2cos* 6 —1=1-— 2sin? 6 
15. sina + sin 6 = 2sin 7 (a + B)cos 7 (0 — B) 

16. cos a + cos B = 2cos5(a + B)cos+(a — £) 


Triangles 


: . a _ b == c 
1. Law of sines: >— = snf — siny 


2. Law of cosines: c? = a? + b? — 2abcos y 


Cc 


3. Pythagorean theorem: a” + b? = c? 


a [IN 


Series expansions 


1. Binomial theorem: 


(a+ b)” =a"+na" Ip, Ae ar 


n(n—1)(n—2)a"—3b 


3! 


2.(l+2)" Hine 4 MOM. (a? <1) 
3.(lt2)" =15 9¢ + SF (2? <1) 


fhe =e Sat 
5.cosz = 1— 2 es = 


6.tanz=2+2% ptt 
Pe ~i4te+24- 


Rd ea a= 5a Pew Se (la) 1) 


Derivatives 
1. laf (2)| = aff (2) 
2. de lf (x) + 9(2)] = ef (2) + #92) 
3. If (x)g(x)| = f(x) $a (2) + 9(2) 4 
4. f(u) = (alle 
Bi £ =ma™! 
6. — sill tT = cos 7 


Es 4. cos x = —sinz 
d. 2 
8. aa tan x = sec’ x 
9. --- cot x = —csc? x 
6 bg 
10. + sec xz = tanzsecz 
11. 4. csc 2 = —cot x csc z 
12 tet =e 
XL 
13. £ Ing = + 
ce ee i 
14 7, Sn © = = 
1 1 
15 Gz COS “X= — SS 
de ee 
16. = tan 2 = ea 
Integrals 


1. faf(a\dx =a | F(v)da 
Aa (@)lde = | F(w)de + [ 9(2)de 


gmt 


= el (m a —1) 


Bs 
= Inaz(m = -1) 


4. | sinz dz = —cosz 


5. | cosxzdz =sinz 


“x  sin2azr 


7. | sin? ax dx = 7 


4a 
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Periodic Table of the Elements 
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promethium || samarium europlum gadolinium terbium dysprosium 
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Neroviurm | |unumpentum| | livermorium | jununseptum) | ununoctium 
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Er | Tm | Yb || Lu 
167.3 || 1689 || 173.1 |] 175.0 
erbium thutum tterblum lutetiuemn 
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Atomic 


number symbol 


Atomic mass 
Name 


O 


Lo 


a 


Color Code 


Metal 
Metalloid 
Nonmetal 


Solid 
Liquid 
Gas 


The Greek Alphabet 


Name Capital Lowercase Name Capital Lowercase 
Alpha A a Nu N V 
Beta B B Xi = € 
Gamma Tr Y Omicron O O 
Delta A 6 Pi II as 
Epsilon Ie € Rho P p 
Zeta Z ¢ Sigma yu o 
Eta H n Tau T 7 
Theta 3) 0 Upsilon Y ) 
lota I L Phi ® p 
Kappa K K Chi Xx 

Lambda A r Psi wU w 
Mu M Lb Omega Q, ey) 


The Greek Alphabet 


